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Abstract 

Current  adaptive  optical  telescope  designs  use  a  single  deformable  mirror  (DM),  usually  con¬ 
jugated  to  the  telescope  pupil,  to  compensate  for  the  cumulative  effects  of  optical  turbulence  along 
a  single  observation  direction.  The  corrected  field  of  view  (FOV)  of  an  adaptive  optics  system  could 
theoretically  be  increased  through  the  use  of  multiple  DMs  optically  conjugated  to  a  like  '-■umber  of 
corresponding  planes  which  sample  the  turbulence  region  in  altitude.  Often,  the  atmospheric  tur¬ 
bulence  responsible  for  the  degradation  of  long-exposure  telescope  images  is  concentrated  in  several 
relatively  strong  layers.  The  logical  location  for  the  conjugate  planes  in  a  multiconjugate  adaptive 
optics  (MCAO)  system  would  be  the  same  as  these  “seeing  layers.”  Each  DM  would  correct  for  the 
component  of  the  total  wavefront  in  the  pupil  contributed  by  its  corresponding  turbulent  layer.  If 
the  atmosphere  does  not  possess  a  distinctly  layered  structure,  the  best  fit  of  the  turbulence  profile 
can  be  made  to  a  layered  model,  with  the  number  of  layers  in  the  model  equal  to  the  number  of 
DMs.  This  dissertation  describes  and  analyzes  two  novel  methods  for  estimating  the  proper  DM 
surfaces  which  would  result  in  wide-FOV  compensation.  Both  methods  take  advantage  of  spatial 
diversity  in  multiple  wavefront  sensor  (WFS)  measurements  in  order  to  reconstruct  an  estimate  of 
the  three-dimensional  turbulence  structure.  The  wavefront  measurements  are  made  using  an  array 
of  artificial  guide  stars  created  by  scattered  laser  light.  The  analysis  includes  the  integrated  ef¬ 
fects  of  measurement  noise,  realistic  models  of  systems  components,  and  the  limitations  of  artificial 
guide  stars.  It  is  shown  that  multiple-DM,  multiple-guide-star  systems  can  significantly  increase 
the  compensated  FOV  relative  to  single-DM,  single-guide  star  systems. 
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INCREASING  THE  CORRECTED  FIELD  OF  VIEW 
OF  AN  ADAPTIVE  OPTICAL  TELESCOPE 

/.  Introduction 

1.1  Problem  Statement 

Nearly  40  years  after  Horace  Babcock  suggested  “the  possibility  of  compensating  astronomical 
seeing”  (2),  the  development  of  adaptive  optics  technology  has  enabled  the  production  of  binary 
star  images  which  have  been  substantially  corrected  for  atmospheric  distortion  (20,  46,  55).  We 
have  not  yet  seen,  however,  similar  high-resolution  images  of  planets,  galaxies,  or  nebulae.  The 
reason  is  that  the  high  level  of  compensation  achieved  by  adaptive  optical  telescopes  is  effective 
only  over  a  small  field  area  called  the  isoplanatic  patch.  At  visible  wavelengths,  even  in  excellent 
observing  conditions,  the  isoplanatic  patch  is  only  a  few  arcseconds  (i.e.  a  few  tens  of  microradians) 
in  diameter  (49).  Many  interesting  astronomical  objects,  however,  extend  from  several  tens  of 
arcseconds  to  arcminutes  in  apparent  size.  The  desire  to  obtain  high-resolution  images  of  these 
objects  from  ground-based  telescopes  is  one  motivation  for  solving  the  problem  addressed  in  this 
dissertation: 

Develop  a  method  for  increasing  the  corrected  field  of  view  (FOV)  of  an  adaptive  optical 
telescope  for  astronomical  imaging  and  conduct  a  performance  analysis  of  an  adaptive 
telescope  which  incorporates  this  technique. 

Adaptive  optics  literature  contains  statements  of  the  need  for  developing  a  method  for  increasing 
the  corrected  FOV,  and  informal  reports  have  advanced  heuristic  suggestions  as  to  how  to  approach 
the  problem,  but  no  detailed  work  on  corrected  FOV  widening  has  been  published.  The  result  of  the 
proposed  research  is  a  detailed  performance  analysis  of  a  new  adaptive  optics  system  design.  The 
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goal  of  the  analysis  was  to  quantify  both  the  benefits  and  the  costs  of  increasing  the  corrected  FOV'. 
as  compared  with  other  adaptive  telescope  designs  found  in  the  literature.  The  outcome  of  this 
research  includes  the  first  in-depth  contributions  in  the  area  of  adaptive  optics  FOV'  widening  (38, 
39). 

This  dissertation  is  organized  into  five  chapters.  Chapter  I  presents  the  justification  for  pur¬ 
suing  the  suggested  study  and  details  the  problem  to  be  solved.  Chapter  II  is  background  material, 
intended  to  familiarize  the  reader  with  the  current  state  of  the  art  and  summarize  information 
about  optical  systems  analysis,  atmospheric  turbulence  effects  on  imaging,  and  adaptive  optics. 
Chapter  III  contains  an  approximate  frequency  domain  analysis  of  the  FOV  widening  problem 
while  Chapter  IV,  the  main  work  of  the  dissertation,  describes  a  rigorous  spatial  domain  analy¬ 
sis.  Conclusions  and  recommendations  for  further  research  are  found  in  chapter  V.  Mathematical 
details  and  results  not  included  in  the  main  text  are  compiled  in  the  Appendices. 

l.S  Justification  for  Conducting  the  Proposed  Research 

In  1989,  astronomers  at  the  Observatoir  de  Haute  Provence,  France,  were  the  first  non¬ 
military  researchers  to  obtain  astronomical  photographic  images  corrected  to  the  diffraction  limit 
of  the  telescope  with  adaptive  optics  (55).  This  experiment  was  conducted  at  infrared  wavelengths 
(2.2-5  /im);  adaptive  optics  is  considerably  more  difficult  to  implement  in  the  visible  region  of  the 
spectrum  where  the  effects  of  atmospheric  turbulence  are  more  pronounced  and  hence,  harder  to 
correct.  Recently,  the  U.S.  Air  Force  has  revealed  the  existence  of  a  1.5-meter  adaptive  telescope 
at  the  Starfire  Optical  Range  (SOR)  in  New  Mexico,  with  which  they  have  obtained  a  number  of 
strikingly  resolved  astronomical  images  with  visible  light  (20).  Prior  to  these  May,  1991,  Air  Force 
announcements,  Welsn  and  Gardner  had  predicted  the  performance  of  an  adaptive  optical  telescope 
based  on  a  detailed  theoretical  analysis  (64).  They  show  that  the  corrected  FOV  of  this  telescope 
will  be  only  a  few  arcseconds  in  diameter.  Many  interesting  astronomical  objects,  such  as  planets 
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(47"  for  Jupiter),  nebulae  (up  to  20'),  and  the  sun  (32'),  subtend  angles  significantly  larger  than 
this  expected  FOV. 

The  military’s  interest  in  adaptive  telescopes  stems  from  intelligence  gathering  requirements 
such  as  imaging  earth-orbiting  satellites  (1).  Air  Force  researchers  at  the  SOR  list  ‘‘wide-FOV’ 
adaptive  optics”  on  their  agenda  of  future  research  (21).  Beckers  (5),  Fried  (19),  and  Hardy  (30), 
prominent  researchers  in  the  field  of  adaptive  optics,  have  written  papers  discussing  the  requirement 
for  FOV  widening  of  adaptive  optical  systems.  Although  they  suggested  ways  to  look  for  a  solution, 
neither  they  nor  any  other  researcher  has  published  a  detailed  study  of  this  problem. 

1.3  Approach 

The  small  size  of  the  compensated  field  of  an  adaptive  optical  telescope  does  not  result 
from  component  limitations  such  as  measurement  noise  but  from  a  more  fundamental  source:  the 
vertical  extent  of  the  atmospheric  turbulence  structure.  Although  atmospheric  refractive  index 
inhomogeneities,  which  are  the  source  of  astronomical  image  distortion,  form  a  three-dimensional 
spatial  random  process,  a  conventional  adaptive  optical  telescope  applies  only  a  two-dimensional 
spatial  correction  using  a  single  deformable  mirror  (DM).  The  DM  surface  is  intended  to  undo 
the  optical  phase  distortion  caused  by  the  atmosphere,  but  it  can  only  do  so  for  the  cumulative 
phase  distortion  along  one  propagation  direction.  This  direction  of  optimum  compensation  will 
correspond  to  the  location  of  the  reference  source,  or  guide  star,  which  is  used  for  the  atmospheric 
distortion  measurements.  The  correlation  between  the  surface  of  the  DM  and  the  cumulative  phase 
distortion  along  other  propagation  directions  drops  off  rapidly  with  the  offset  angle  between  the 
reference  and  observation  directions. 

One  approach  to  increase  the  corrected  field  of  view  of  an  adaptive  optical  telescope  is  to  devise 
a  method  of  applying  a  three-dimensional  correction  for  atmospheric  distortion.  An  example  of  a 
three-dimensional  phase  correction  system  would  be  a  series  of  DMs  in  the  optical  train  of  the 
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adaptive  telescope.  Dicke  was  the  first  to  suggest  this  idea  (11),  proposing  a  two-DM  system  to 
correct  for  low  and  high  altitude  contributions  to  atmospheric  distortion.  More  recently,  Beckers 
has  considered  this  concept,  coining  the  term  “multiconjugate  adaptive  optics  (MCAO).”  Beckers 
suggested  optically  conjugating  each  DM  to  a  plane  at  a  specified  distance  from  the  telescope 
aperture  (5).  Ideally,  these  conjugate  planes  would  be  located  at  altitudes  where  the  atmospheric 
turbulence  strength  is  chiefly  concentrated. 

To  illustrate  the  idea  of  MCAO,  Figure  1.1  depicts  a  schematic  comparison  between  single-DM 
and  two-DM  correction  when  the  atmospheric  phase  distortion  is  confined  to  two  thin  turbulent 
layers.  The  phases,  <j>i  (rad),  in  the  figure  represent  the  contribution  to  the  phase  distortion  by 
the  indicated  point  in  the  turbulent  layer.  In  Figure  1.1(a),  a  ray  from  object  point  1  to  the 
indicated  point  in  the  aperture  plane  undergoes  a  cumulative  phase  distortion  of  <f>i  +  <j>3  rad.  If 
this  object  point  is  used  as  the  reference,  a  phase  compensation  <j)c  =  —<i>\  —  <i>3  rad  is  applied 
in  the  aperture  plane.  (Usually,  the  DM  is  optically  conjugated  to  the  aperture  plane,  so  we 
can  imagine  the  phase  correction  taking  place  in  this  plane.)  A  ray  from  object  point  2  to  the 
same  point  in  the  aperture  plane  has  accumulated  a  phase  deviation  of  <j)2  +  <f>4  rad.  Depending 
on  the  spatial  correlation  of  the  layers,  the  phase  compensation  <j>c,  appropriate  for  object  point 
1,  may  be  of  little  benefit  in  compensating  a  wavefront  propagating  from  object  point  2.  This 
decorrelation  of  the  phase  correction  with  offset  angle  is  known  as  angular  anisoplanatism.  Even 
for  a  “perfect”  adaptive  optical  telescope,  having  no  measurement  noise  or  DM  figuring  error, 
angular  anisoplanatism  presents  a  fundamental  limit  to  performance. 

Now  consider  Figure  1.1(b).  Instead  of  a  single  phase  correction  in  the  aperture  plane,  this 
figure  shows  two  separate  phase  corrections  made  in  the  conjugate  planes  of  the  turbulent  layers. 
The  surface  of  DM  1  is  figured  to  negate  the  phase  aberration  caused  by  turbulent  layer  1,  while 
DM  2  compensates  for  turbulent  layer  2.  In  this  case,  the  rays  from  both  object  points  receive 
the  correct  phase  compensation.  Assuming  the  availability  of  ideal  adaptive  optical  components, 
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angular  anisoplanatism  could  be  completely  eliminated  when  the  number  of  thin  turbulent  layers 
equals  the  number  of  DMs. 

VVe  have  seen  in  the  preceding  example  that  MCAO  can  eliminate  angular  anisoplanatism 
when  optical  turbulence  can  be  accurately  modeled  by  a  small  number  of  thin  phase-changing 
screens.  Ultimately,  we  would  like  to  know  if  MCAO  can  significantly  widen  the  corrected  FOV  of 
an  adaptive  optical  telescope  over  a  variety  of  realistic  turbulence  conditions.  Before  this  question 
can  be  answered,  however,  we  need  to  determine  a  means  of  estimating  the  individual  contribu¬ 
tions  of  different  layers  of  the  atmosphere  to  the  total  wavefront  aberration  in  the  aperture  of  the 
telescope.  An  estimate  of  the  contribution  of  the  individual  layers  is  required  in  order  to  properly 
figure  the  conjugated  DMs.  Dicke  first  suggested  that  high  and  low  altitude  turbulence  effects  could 
be  distinguished  using  the  difference  in  the  wind  speed  at  the  two  altitudes.  By  comparing  phase 
distortion  measurements  made  approximately  0.01  sec  apart,  he  presumed  that  the  low  and  high 
altitude  phase  distortion  contributions  could  be  separated  (11).  It  is  not  known  if  Dicke ’s  scheme 
was  ever  implemented,  but  his  concept  of  processing  multiple  linearly  independent  wavefront  mea¬ 
surements  to  estimate  the  phase  contributions  of  different  parts  of  the  atmosphere  is  the  key  to  the 
approach  presented  in  this  dissertation. 

Fried  suggested  obtaining  the  multiple  wavefront  measurements  by  projecting  through  the 
turbulence  region  at  a  variety  of  angles  using  multiple  reference  sources  (19).  The  measurements 
are  linearly  independent  in  this  case  because  they  are  separated  spatially,  rather  than  temporally 
as  in  Dicke’s  proposal.  To  determine  what  phase  function  to  put  on  the  surface  of  the  DMs,  Fried 
suggested  interpreting  the  measurements  as  arising  from  propagation  through  a  series  of  phase 
screens,  the  number  of  phase  screens  equal  to  the  number  of  DMs.  Even  if  the  turbulence  does  not 
have  a  layered  structure,  one  could  make  the  best  fit  of  a  layered  structure  to  the  measured  data. 
Although  angular  anisoplanatism  may  not  be  completely  eliminated,  it  is  hoped  that  it  could  be 
substantially  mitigated. 
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The  advent  of  the  laser  guide  star,  or  synthetic  beacon,  has  raised  the  practical  possibility 
of  multiple  linearly  independent  wavefront  measurements  in  astronomical  applications  of  adaptive 
optics  (15,  20,  35,  50,  58,  65).  Artificial  reference  sources  are  necessary  because  there  are  too 
few  natural  stars  bright  enough  to  use  as  reference  sources  for  the  wavefront  sensor  (WFS)  mea¬ 
surements.  Beckers  (5)  and  Tallon  and  Foy  (57)  have  proposed  ways  to  combine  the  wavefront 
measurements  from  each  reference  source  to  obtain  the  correct  phase  perturbation  values  to  asso¬ 
ciate  with  each  phase  screen  in  the  propagation  path.  Beckers’  idea  involves  spatially  shifting  each 
measurement  to  “line  up”  the  contribution  of  a  particular  phase  screen  across  the  ensemble.  The 
contributions  of  the  other  phase  screens  will  be  misaligned.  An  average  of  the  measurements  would 
then  presumably  be  weighted  toward  the  contribution  of  the  desired  phase  screen,  while  the  other 
contributions  will  tend  to  average  out.  Tallon  and  Foy  suggest  modeling  each  phase  screen  as  a 
mosaic  of  coherence  areas.  Two  rays  passing  through  the  same  coherence  area  are  considered  to 
have  experienced  the  same  phase  perturbation.  A  constellation  of  artificial  guide  stars  is  arranged 
so  that  all  the  coherence  areas  in  all  the  phase  screens  are  probed.  To  obtain  the  phase  map  asso¬ 
ciated  with  each  phase  screen,  Tallon  and  Foy  suggest  constructing  and  solving  a  system  of  linear 
equations,  with  the  phase  perturbations  associated  with  each  coherence  area  as  the  unknowns  and 
the  Hartmann  WFS  measurements  as  the  known  quantities.  A  detailed  analysis  of  either  of  these 
proposed  methods  for  estimating  the  phase  screen  functions  has  not  been  published. 

In  this  dissertation,  two  novel  methods  of  obtaining  the  phase  screen  contributions  from  WFS 
measurements  are  proposed.  The  first  method  involves  transforming  the  phase  functions  into  the 
spatial  frequency  domain.  ,The  transforms  of  the  WFS  measurements  are  linearly  combined  to 
obtain  estimates  of  the  transforms  of  the  phase  screen  contributions.  In  computing  the  linear 
combination  of  measurements,  an  optimum  set  of  complex  weights  is  derived  in  order  to  minimize 
the  mean-squared  error  in  the  estimated  phase  screen  contribution.  Since  complex  weights  in  the 
spatial  frequency  domain  correspond  to  amplitude  weighting  and  shifting  in  the  spatial  domain. 
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this  estimation  approach  is  similar  to,  but  more  general  than  that  proposed  by  Beckers.  Details  of 
this  frequency  domain  approach  are  the  subject  of  Chapter  III. 

The  second  approach,  detailed  in  Chapter  IV,  is  to  work  in  the  spatial  domain  and  estimate 
the  phase  screen  contributions  as  a  linear  combination  of  wavefront  slope  measurements.  This 
approach  is  an  extension  of  the  method  presented  by  Wallner  in  (62).  Whereas  Wallner  only  used  a 
single  guide  star  to  estimate  the  total  wavefront  phase,  this  dissertation  describes  for  the  first  time 
how  to  combine  multiple  wavefront  measurements  to  estimate  the  contribution  of  different  parts  of 
the  atmosphere  to  the  total  wavefront  phase.  Both  approaches  assume  a  prion  knowledge  of  the 
atmospheric  turbulence  conditions,  which  must  be  obtained  from  a  remote  sensing  instrument. 

Before  construction  of  an  MCAO  system  can  be  considered,  a  theoretical  analysis  of  the 
concept  is  in  order.  Such  an  analysis  is  necessary  to  answer  several  questions  concerning  practical 
issues.  For  example,  how  accurately  can  we  estimate  each  phase  perturbation  function?  How 
much  improvement  in  the  corrected  FOV  may  be  expected  for  a  particular  atmospheric  turbulence 
structure,  given  a  specified  number  of  DMs?  What  is  the  effect  of  WFS  noise  and  DM  figuring 
error  on  such  a  system?  The  purpose  of  this  dissertation  is  to  present  analysis  methods  which  can 
be  used  to  study  these  questions. 

1.4  Assumptions 

Three  major  assumptions  are  made  in  this  dissertation:  1)  geometrical  optics  is  adequate  to 
describe  propagation  down  through  the  atmosphere  at  a  good  observatory  site,  2)  the  atmospheric 
turbulence  structure  at  such  a  site  may  be  modeled  as  a  series  of  phase-changing  screens,  and  3) 
multiple  reference  sources,  less  than  one  arcminute  apart,  are  available  for  WFS  measurements.  By 
assuming  geometrical  optics  propagation,  any  refraction  or  diffraction  of  optical  waves  traversing 
the  atmosphere  is  ignored.  In  other  words,  the  only  atmospheric  effect  considered  is  phase  delay 
due  to  optical  path  length  changes  caused  by  variations  in  the  index  of  refraction.  The  validity 
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of  this  assumption  can  only  be  established  experimentally — doing  so  is  beyond  the  scope  of  this 
dissertation.  Regardless  of  the  results  of  this  research,  successful  practical  implementation  of  MCAC 
depends  on  how  closely  the  near  field  criterion  approximates  reality.  Most  researchers  consider  this 
assumption  to  be  valid  for  a  good  observatory  site  (5,  26,  63).  A  theoretical  argument  in  support 
of  the  validity  of  this  assumption  is  presented  in  Section  2.3.5. 

Several  factors  motivated  the  use  of  a  layered  atmosphere  model  for  an  initial  analysis  of 
MCAO.  One  reason  is  that  experimental  evidence  indicates  that  the  optical  turbulence  in  the 
atmosphere  is  often  concentrated  in  a  small  number  of  layers  (52,  61).  However,  we  could  approx¬ 
imate  any  turbulence  distribution  by  using  a  sufficient  number  of  layers.  Since  an  MCAO  system 
approximates  a  layered  atmospheric  structure,  the  use  of  such  a  model  is  convenient  for  analysis 
purposes.  In  addition,  it  allows  the  mathematical  results  to  be  expressed  in  terms  of  the  phase 
statistics  instead  of  refractive  index  statistics. 

Earlier  it  was  pointed  out  that,  due  to  the  lack  of  adequate  natural  stars,  the  multiple 
WFS  reference  sources  in  an  MCAO  system  must  be  artificially  generated  by  laser  backscatter. 
Practical  issues  not  addressed  in  this  dissertation  include  accurate  calibration  of  the  synthetic 
beacon  projection  system  and  associating  wavefront  measurements  with  the  correct  reference  source. 
It  is  assumed  that  the  beacons  can  be  projected  with  offset  angles  on  the  order  of  a  few  arcseconds 
and  that  all  phase  perturbation  measurements  can  be  made  within  the  atmosphere’s  correlation 
time  (27)  so  that  time  multiplexing  can  be  used  to  associate  the  measurements  and  reference 
stars.  This  requirement  clearly  limits  the  number  of  measurements,  but  researchers  have  already 
demonstrated  adaptive  optics  using  sequential  wavefront  measurements  from  two  artificial  guide 
stars  (47). 
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1.5  Scope 


The  analysis  in  this  dissertation  involves  only  spatial,  not  temporal,  quantities  and  the  scope 
of  the  analysis  is  limited  to  open-loop  operation  (i.e.,  the  system  measures  and  reconstructs  the 
incoming  wavefronts  individually  vs.  nulling  the  residual  phase  error  via  a  feedback  loop).  Optical 
design  issues  (materials,  losses,  etc.)  are  not  considered.  This  dissertation  concentrates  on  studying 
signal  processing  techniques  as  opposed  to  the  design  of  wavefront  sensing  devices  and  methods. 

1.6  Significant  Results 

Two  novel  methods  for  obtaining  the  correct  DM  surface  functions  are  developed.  Addition¬ 
ally,  thorough  analysis  demonstrates  for  the  first  time  that  MCAO  is  a  promising  technology  for 
increasing  the  compensated  FOV  of  an  adaptive  optical  telescope.  However,  the  inability  of  artificial 
guide  star  systems  to  sense  overall  tilt  limits  the  theoretical  performance  of  MCAO  systems. 

1.7  Summary 

In  this  chapter  we  have  described  the  dissertation  problem,  which  is  to  analyze  a  method  for 
increasing  the  corrected  FOV  of  an  adaptive  optical  telescope.  The  approach  is  to  add  additional 
wavefront  correcting  elements  (DMs)  in  the  optical  train  of  the  telescope.  To  obtain  the  commands 
for  multiple  DMs,  multiple  linearly  independent  measurements  of  the  atmospheric  phase  distortion 
are  needed.  The  main  work  of  this  dissertation  involves  proposing  and  analyzing  methods  for 
obtaining  the  DM  commands  from  the  measurements.  Before  describing  this  analysis  in  chapters  III 
and  IV,  the  next  chapter  presents  some  background  material  on  atmospheric  and  adaptive  optics. 
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II.  Background 


2.1  Introduction 

It  is  well  known  that  the  atmosphere  degrades  the  performance  of  optical  imaging  systems 
such  as  ground-based  astronomical  telescopes.  This  atmospheric  distortion  is  the  result  of  inhomo¬ 
geneities  in  the  index  of  refraction,  n,  of  the  air,  randomized  by  the  turbulent  mixing  which  results 
from  solar  heating.  During  the  past  several  decades,  great  progress  has  been  made  in  describing 
and  measuring  the  effects  of  turbulence  on  light  passing  through  the  atmosphere  and  in  devising 
methods  of  correcting  for  these  effects  (29,  31,  43,  60).  Such  image  compensation  methods  fall  into 
two  categories;  predetection  and  post-detection  compensation.  Post-detection  compensation  en¬ 
tails  processing  already  blurred  images  to  reduce  the  degradation  due  to  the  interfering  atmosphere. 
This  post-processing  usually  involves  multiple  short-exposure  images,  that  is,  images  obtained  using 
an  integration  time  sufficiently  short  to  “freeze”  a  particular  realization  of  the  random  atmospheric 
turbulence  structure.  Predetection  compensation,  also  known  as  adaptive  optics,  involves  nulling 
the  phase  component  of  the  distortion  in  real-time  using  a  servo  loop.  This  dissertation  is  concerned 
with  predetection  compensation  systems. 

In  chapters  III  and  IV,  the  performance  of  adaptive  optics  systems  will  be  evaluated  in  terms 
of  the  variance  of  the  phase  aberration  function  in  the  pupil  plane  of  the  telescope.  In  the  context 
of  adaptive  optics  system  analysis,  the  error  that  remains  after  compensation  is  of  interest.  Thus 
this  criterion  is  often  referred  to  as  “mean-square  residual  phase  error.”  The  reason  this  criterion 
is  chosen  is  because  it  is  directly  related  to  optical  system  performance.  Specifically,  as  shown  in 
Section  2.2,  the  phase  aberration  function  determines  the  system  optical  transfer  function  (OTF), 
point  spread  function,  and  Strehl  ratio.  Other  concepts  underlying  the  analysis  in  chapters  III 
and  IV  include  the  refractive  index  structure  constant,  C^,  phase  structure  function,  2)^,  and  the 
atmospheric  coherence  diameter,  tq.  Section  2.3  defines  these  terms  and  relates  them  to  each  other. 
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Finally  sections  2.4  and  2.5  provide  background  on  adaptive  optical  telescopes  and  laser  guide  stars, 
respectively. 

2  '  Incoherent  Optical  Systems  Analysis 

2.2.1  Optical  Transfer  Function.  Consider  Goodman’s  generalized  model  of  an  imaging  sys¬ 
tem  shown  in  Figure  2.1.  The  object  being  imaged  lies  in  the  XoVo  plane.  The  coordinates  of  the 
object  plane  are  normalized  by  the  magnification  of  the  imaging  system  such  that  if  the  physical 
coordinates  of  a  point  on  the  object  are  (A'o,  To),  then  its  coordinates  in  the  XoVo  plane  are  given  by 
xo  =  MA'0,2/0  =  A/To,  where  M  is  the  system  magnification.  Note  that  M  can  be  either  negative 
or  positive;  negative  M  indicating  that  the  image  is  inverted.  With  this  normalization,  an  object 
point  and  its  corresponding  point  in  the  image  (xiy,)  plane  will  have  the  same  coordinate  values. 


Entrance  Exit 


Figure  2.1.  Generalized  model  of  an  imaging  system  (25). 

The  imaging  system  may  be  mathematically  modeled  by  a  transmission  function  (7),  also 
known  as  the  impulse  response  function  (37),  represented  by  h(xo,  Po- a*i,  yj;  A)  with  A  representing 
wavelength  and  bold  type  indicating  a  complex  quantity.  Born  an<I  Wolf  define  h  as  the  normalized 
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complex  amplitude  at  image  point  (xj,  j/;),  due  to  a  unit  amplitude,  zero  phcuse  disturbance  at  obje^'t 
point  (Xo,  f/o)  (^)-  In  many  ceises,  optical  systems  analysis  is  performed  using  the  form  of  h  derived 
for  monochromatic  light,  therefore  the  wavelength  dependence  will  be  dropped  from  this  point  on. 

From  the  Huygens- Fresnel  principle  and  electromagnetic  theory,  it  is  known  that  the  propa¬ 
gation  of  light  is  a  linear  phenomenon.  Hence,  if  Uo  is  the  complex  amplitude  of  the  optical  field 
emitted  from  the  object,  the  complex  amplitude  of  the  optical  field  in  the  image  plane  can  be  found 
by  means  of  a  superposition  integral  (7): 


Ui(xi,t)  =  j  d-XoUo(XoJ)h(Xo,Xi) 


(2.1) 


where  t  indicates  a  time  varying  object  and  image  field  and  x  —  (x,  y)  is  a  twc-dimcnsional  position 
vector.  The  limits  of  integration  are  formally  infinite  but  effectively  defined  by  h.  For  the  finite 
region  of  the  object  plane  known  as  the  isoplanatic  region,  the  transmission  function  of  a  good 
optical  system  will  be  shift  invariant.  Hence,  for  objects  smaller  than  the  isoplanatic  region, 
equation  (2.1)  can  be  written  as  a  convolution  integral  (7); 


Ui(xi,/)  =  j  d^XoUoiXo,f)h{Xi-  Xo) 


(2.2) 


A  practical  imaging  system  records  the  image  intensity,  /j,  which  is  defined  as  the  time  average  of 
the  magnitude  squared  of  the  complex  field  amplitude.  Using  angle  brackets  to  denote  averaging 
in  time  and  z*  for  the  complex  conjugate  of  z,  the  image  intensity  can  be  written  as  (37) 


h{^i)  =  J  J 


(uo(xo,<)u;(xJ„<))h(ri  -  Xo)h*(xi  -  x'J 


(2.3) 


2-3 


For  the  Ccise  of  an  object  radiating  incoherent  light,  the  image  intensity  can  be  easily  related  to  tlie 
object  intensity.  For  the  incoherent  imaging  case,  the  object  mutual  intensity  is  given  by  (37) 

(Uo(fo,<)Uo(-i^o.O)  =  |Uo(?o)P'5(  Co  -  •r'o)  =  io{Xo)Sixo  -  x'j  (2.4) 

Substituting  equation  (2.4)  into  equation  (2.3)  and  using  the  sifting  property  of  the  Dirac  delta 
function,  6(x  ),  it  is  found  that  the  image  intensity  for  an  optical  system  in  the  presence  of  incoherent 
light  is  described  by  the  following  expression,  in  which  the  operator  *  stands  for  convolution  (37): 

A(^i)  = /o(^i)  *  N^i)P  (2.5) 


The  Fourier  transform  of  the  preceding  relationship  may  be  used  to  analyze  the  optical  system 
in  the  spatial  frequency  domain.  Typically,  the  Fourier  transforms  of  each  of  the  quantities  in 
equation  (2.5)  are  normalized  by  their  zero-frequency  values.  Using  script  letters  to  indicate  these 
normalized  Fourier  transforms,  the  frequency  domain  representation  of  incoherent  imaging  is  (37) 

(2.6) 


where  /  =  (/x,  /y)  is  a  spatial  frequency  vector.  H  in  equation  (2.6)  is  the  optical  transfer  function, 
given  by  (37) 


nif)  = 


:r{|h(fi)P}/=o 


(2.7) 


where  X{  }  represents  the  Fourier  transform. 


The  form  of  the  optical  transfer  function  can  be  derived  if  a  point  source  object  is  used  in 
equation  (2.2).  For  a  point  source  object,  the  resulting  image  will  be  the  system  impulse  response. 
Referring  again  to  the  generalized  imaging  system  model  of  Figure  2.1,  the  usual  procedure  is  to 
assume  that  geometrical  optics  applies  to  the  contents  of  the  “black  box.”  Under  this  assumption. 
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diffraction  effects  are  entirely  due  to  the  entrance  or  exit  pupil,  whichever  is  smaller.  Here  it  is 
assumed  that  the  exit  pupil  is  the  smaller  aperture,  described  by  the  pupil  function  Wa(x).  If  the 
exit  pupil  is  preceded  by  a  converging  lens,  and  the  distance  d\  to  the  image  plane  satisfies  the  lens 
law,  point  source  illumination  will  produce  an  image  which  is  the  Fraunhofer  diffraction  pattern  of 
the  exit  pupil.  This  image  is  the  impulse  response  function  (25); 

h(ii-£o)  =  y  Wa(?)exp  -  Xo)  •  x]|  (2.8) 


In  the  preceding  equation,  a  constant  multiplicative  factor,  which  is  of  no  consequence  due  to 
the  normalized  form  of  the  OTF  definition,  has  been  neglected.  With  the  following  change  of 
variables  (25); 


/ 


(2.9) 


in  equation  (2.8),  the  impulse  response  function  becomes,  apart  from  constants  (25); 


h(xi  -  Xo)  =  JJ  Wa(Ac/i/)exp  ^-j2ir[{xi  -  Xo)  •  /]|  (2.10) 


Thus,  the  Fourier  transform  of  h  is  Wa(Adi/).  Applying  the  Fourier  autocorrelation  theorem  to 
equation  (2.7)  leads  to  the  well-known  formula  for  the  optical  transfer  function  of  a  diffraction- 
limited  imaging  system  (26); 


-Hif)  = 


Wa(Adi/)*W;(Acli/) 

J  d2x|Wa(x)P 


(2.11) 


where  the  operator  ★  indicates  correlation.  This  formula  can  also  be  used  for  systems  that  are 
not  diffraction-limited  by  modifying  the  pupil  function  to  account  for  aberrations  in  the  optics  or 
inhomogeneities  in  the  intervening  medium.  For  example,  to  obtain  an  OTF  for  a  system  consisting 
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of  both  atmosphere  and  optics,  it  is  necessary  to  start  with  a  new  pupil  function  (26), 


Wi(x)  =  Wa(f)t(x) 


(2.12) 


where  the  transmittance  function  t(x )  accounts  for  the  effects  of  the  atmosphere.  For  astronomical 
imaging  at  a  good  observatory  site,  it  may  be  assumed  that  the  turbulence  is  close  to  the  aperture, 
relative  to  the  object  distance,  and  sufficiently  weak  that  diffraction  and  refraction  may  be  ignored. 
Because  of  this  assumption,  phase  delay  is  considered  the  only  atmospheric  effect  of  consequence, 
and  the  transmittance  function  is  chosen  to  model  phase  disturbances  only.  Hence,  t(x)  can  be 
written  (26) 

t(f)  =  exp[jV’{^)]  (2.13) 


where  r/>  is  the  random  phase  perturbation. 

Substituting  equations  (2.12)  and  (2.13)  into  the  OTF  formula  leads  to  an  expression  for  the 
instantaneous  OTF  of  the  turbulence-degraded  system; 


«(/)  = 


J I  d2£Wa(x)W;(x  -  M/)exp[;(V-i  -  V-2)] 


J I  tW,(x)p 


(2.14) 


where 


^>1  =  ^(x) 

rp2  =  (2.15) 


To  get  the  long  exposure  OTF,  the  usual  procedure  is  to  assume  that  ip{x)  is  an  ergodic  random 
process  and  find  the  ensemble  average  OTF,  which  is  defined  as  the  expected  value  of  the  numerator 
of  equation  (2.14)  divided  by  the  expected  value  of  the  denominator  of  equation  (2.14)  (26).  Note 
that  this  definition  is  not  the  same  as  but  it  contains  the  same  information  and  is  easier 
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to  compute.  With  this  definition,  it  is  noted  that  the  average  OTF  of  the  aberrated  system  can  be 
written  as  the  product  of  the  OTF  for  the  diffraction-limited  system  times 

■WlI/)  =  {exp(j(V'i  -  V’2)])  (2.16) 

which  is  called  the  long  exposure  OTF  of  the  atmosphere  (26).  Here  the  angle  brackets  denote 
expectation. 

The  Rytov  solution  for  propagation  of  an  electromagnetic  wave  in  a  medium  of  inhomogeneous 
n  (with  the  variation  contained  in  the  permittivity)  predicts  that  i>(x)  has  Gaussian  statistics. 
Based  on  this  fact,  and  the  fact  that  the  phase  fluctuations  are  zero-mean,  it  can  be  shown  that 

'WL(/)=exp  -^VrpC^dif)  (2.17) 

where  f  =  \f\  and 

2)^(r)  =  +  (2.18) 

is  the  phase  structure  function  discussed  in  Section  2.3.  In  writing  as  a  function  of  r,  isotropic 
turbulence  was  assumed  with  r  =  |r  |. 

As  pointed  out  in  the  introduction  to  this  chapter,  the  performance  of  an  adaptive  optical 
system,  as  measured  by  the  OTF,  is  a  function  of  the  phase  aberration  function,  in  the  pupil 

plane.  The  objective  of  an  adaptive  optical  telescope  is  to  reduce  V'(£)  to  0  at  all  points  in  the 
aperture,  thereby  achieving  the  diffraction  limited  OTF.  Other  measures  of  optical  system  quality, 
the  point  spread  function,  resolution,  and  the  Strehl  ratio,  are  derivable  from  the  OTF  and  thus 
also  functions  of 

2.2.2  Point  Spread  Function.  The  point  spread  function  (PSF)  is  the  inverse  Fourier  trans¬ 
form  of  the  optical  transfer  function.  The  PSF  describes  the  incoherent  image  of  a  point  source. 
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For  a  diffraction-limited  optical  system  with  a  cin  ular  exit  pupil,  the  PSF  is  the  familiar  Airy 
pattern,  [2Ji(r)/r]^,  where  Ji(  )  is  the  Bessel  function  of  the  first  kind,  order  1  (25). 

2.2.3  Resolution.  Optical  system  resolution  may  be  evaluated  in  either  the  spatial  or  the 
spatial  frequency  domain.  In  the  spatial  domain,  the  Rayleigh  criterion  is  often  used.  By  this 
criterion,  if  the  peak  of  the  Airy  pattern  resulting  from  one  object  point  is  located  on  the  first 
zero  of  the  Airy  pattern  resulting  from  another  object  point,  the  two  points  are  considered  “barely 
resolved”  by  the  system  (25).  Another  spatial  domain  parameter  is  the  full  width  at  half  maximum 
(FWHM)  of  the  point  spread  function.  Frequency  domain  measures  of  resolution  include  the  cutoff 
frequency  of  the  optical  transfer  function  as  well  as  the  volume  under  the  OTF  (26). 

2.2.4  Sirehl  Ratio.  The  Strehl  ratio  is  defined  as  the  ratio  of  the  peak  of  the  point  spread 
function  of  an  actual  optical  system  to  that  of  a  diffraction-limited  system.  It  is  a  measure  of  the 
additional  spreading  of  the  image  intensity  distribution  due  to  aberrations  in  the  optical  system.  To 
show  how  the  Strehl  ratio  is  related  to  the  wavefront  phase,  Born  and  Wolf’s  derivation  is  presented 
in  this  section,  with  the  aid  of  Hardy  (7,  29). 

Figure  2.2  depicts  a  perfectly  spherical  wave  converging  to  an  image  point  P.  A  diffraction- 
limited  system  wouid  create  such  a  wave  from  a  perfectly  spherical  diverging  wave  that  was  received 
by  its  entrance  pupil.  The  free-space  Green’s  function  describes  this  wave  (25): 

G(fl)=!5E(^  (2.19) 

where  R  is  the  radius  of  the  sphere  and  k  =  2ir/X  is  the  wave  number.  Figure  2.2  also  shows  an 
aberrated  wavefront  which  is  displaced  from  the  reference  sphere  by  an  amount  ip  (rad),  which  is  a 
function  of  position.  If  is  so  small  that  the  amplitude  of  the  aberrated  wave  may  be  considered 
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Aberrated 


constant,  an  expression  for  the  field  follows  directly  from  equation  (2.19): 


Gi(fl)  =  A 


exp[-j(kR  -  V>)] 
R 


(2.20) 


where  A  is  the  constant  amplitude  at  unit  distance  from  the  source.  The  Huygens-Fresnel  principle 
states  that  the  field  at  point  P  may  be  considered  as  the  resultant  of  an  infinite  number  of  point 
sources  located  on  the  portion  of  the  aberrated  wavefront  enclosed  by  the  exit  pupil  of  the  optical 
system.  Hence,  the  field  at  P  (the  center  of  the  reference  sphere)  may  be  found  using  the  Rayleigh- 
Sommerfeld  diffraction  formula  (25): 

U(/’)=  exp(jfcfi)  J  ^2,21) 

the  integration  takes  place  over  the  exit  pupil.  From  this  point  on  in  the  dissertation,  the  pupil  will 
be  modeled  by  a  real  aperture  weighting  function  H'a(i )  normalized  for  mathematical  convenience 
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such  that 

J  d^xW^ix)=l  (2.22) 

Note  ill  equation  (2.21)  that  the  diffraction  angles  are  assumed  small  enough  that  the  obliquity 
factor  may  be  neglected.  From  equation  (2.21),  the  intensity  at  point  P  follows  (29): 


I[P) 


d^x  lVa(x  )  exp[ji/)(x  )] 


(2.23) 


The  intensity  of  an  unaberrated  system,  I\(P),  may  be  found  by  setting  ti’(x  )  =  0  (29): 


h(P)  = 


(2.24) 


Earlier  in  this  section,  the  Strehl  ratio  was  defined  as  the  ratio  of  the  intensity  of  the  aberrated 
system  to  that  of  the  diffraction-limited  system. 


s[V'(i )] 


i(P) 

h(P) 


1/ 


d^xWt,{x)expljtp{x)] 


(2.25) 


Using  the  power  series  expansion  for  exp[j^(>(x  )],  the  above  expression  can  be  rewritten  (7): 


5[V'(x)]=  J  d^xW^{x) 


l+jip{x)-  -V’^(x)  +  ... 


(2.26) 


For  small  t^(x )  (7), 


S[ip{x)] 


(2.27) 


where  the  overbar  indicates  the  averages  of  tl^{x)  and  4'^(x)  over  the  exit  pupil  (7): 


/ 


d^xw^{x)r{s) 


(2.28) 
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Expanding  equation  (2.27),  and  continuing  to  drop  terms  of  higher  than  quadratic  order,  the  Strehl 
ratio  approximation  becomes  (7) 

S[V'(i)]%l-  (2.29) 

Defining  c~  as  the  aperture  average  variance  of  the  phase  aberration  function: 

=  J  (fW^(x  ) 

=  (2.30) 

the  Strehl  ratio  can  be  written 

S(c2)  =  l-f^  (2.31) 

A  second  application  of  the  series  expansion  for  exp(a:)  yields  the  further  approximation  for  small 

€  (22): 

S(e2)«exp(-€^)  (2.32) 

with  P  (rad^)  being  the  total  phase  error  variance  of  the  optical  system.  Tyson  points  out  that 
this  formula  is  valid  for  f  <  1.25  rad  ss  A/5  (59). 

2.3  Atmospheric  Optics 

The  previous  section  showed  that  several  important  measures  of  optical  system  quality  are 
functions  of  the  phase  of  the  wavefront  in  the  pupil  plane.  Consequently,  the  results  of  the  adaptive 
optics  system  analysis  of  chapters  III  and  IV  are  in  terms  of  statistical  averages  of  the  random 
process  ^>(x).  This  section  discusses  the  statistical  properties  of  atmospheric  phase  distortion 
following  a  geometrical  optics  approach. 
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A  wavefront,  a  useful  concept  in  the  discussion  of  atmospheric  optics,  is  defined  as  a  surface 
of  constant  optical  path  length  orthogonal  to  a  collection  of  rays  (29).  The  light  emanating  from 
a  source  located  in  a  medium  of  constant  n  propagates  with  spherical  wavefronts.  At  a  sufficient 
distance  from  the  source,  the  wavefront  may  be  considered  planar. 

The  index  of  refraction  of  the  atmosphere  is  a  random  function  of  position,  as  well  as  time 
and  wavelength.  Since,  in  a  geometrical  optics  framework,  optical  path  length  between  two  points 
A  and  B  is  equal  to  the  path  integral  of  n  (7), 

(O.P.L.)  =  /;  dsn{s)  (2.33) 

it  is  found  that  initially  planar  wavefronts  become  distorted  as  they  pass  through  the  atmosphere. 
This  distortion  is  described  by  a  two-dimensional  function  of  position,  which  goes  by  various 

names  in  the  literature  such  as  wavefront  aberration  function  (4)  or  waveheight  function  (45). 
Following  most  authors,  this  dissertation  will  frequently  use  the  term  “phase”  as  a  shorthand 
nomenclature  for  since  it  is  understood  that  an  absolute  phase  measurement  is  never  implied. 
The  following  sections  present  the  statistical  quantities  used  to  characterize  optical  turbulence  and 
aberrated  wavefronts. 

2.3.1  Refractive  Index  Power  Spectrum.  The  refractive  index  power  spectrum,  ^n(«),  is 
a  measure  of  the  strength  of  the  optical  turbulence  at  a  point  in  space  as  a  function  of  spatial 
frequency.  The  argument  k  =  (27r//,,27r//y,2ir//i)  is  a  wave  number  vector  indicating  the  scale 
size  of  pockets  of  air,  with  dimensions  /,  x  /y  x  /^,  inside  which  n  is  considered  to  be  approximately 
constant.  These  pockets  of  air  are  called  eddies;  large  eddies  correspond  to  small  values  of  k  while 
small  scale  turbulence  implies  large  k.  may  be  interpreted  as  the  relative  abundance  of  different 
size  eddies  (26). 
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The  formation  of  these  eddies  begins  with  uneven  heating  of  large  volumes  of  air  due  to 
the  position  of  the  sun  and  different  types  of  underlying  terrain.  These  large  volumes  of  air  are 
broken  up  into  eddies  of  various  sizes  by  mechanical  turbulence.  The  resulting  random  distribution 
of  pockets  of  air  having  slightly  different  temperatures  is  the  underlying  cause  of  atmospheric 
distortion.  The  recison  for  this  phenomenon  is  that  the  index  of  refraction,  while  a  function  of 
temperature,  pressure,  humidity,  and  wavelength,  is  more  sensitive  to  variations  in  temperature 
than  to  changes  in  the  other  parameters  (33).  When  the  term  turbulence  is  used  in  the  context  of 
optical  imaging,  it  is  the  three-dimensional  random  process  of  temperature  (or  index  of  refraction) 
that  is  being  referred  to.  Mechanical  turbulence  itself  actually  has  relatively  little  effect  on  imaging. 

The  most  commonly  used  form  of  based  on  the  work  of  Kolmogorov  (42),  is  given  by  (26): 

<^„(/c)  =  0.033C>-“/^  (2.34) 

where  is  the  structure  constant  discussed  in  the  next  section  and  isotropic  turbulence  is  assumed 
(k  =  |k|).  This  expression  is  valid  for  an  intermediate  range  of  scale  sizes  called  the  inertial 
subrange;  it  does  not  describe  the  refractive  index  structure  that  results  from  very  large  or  very 
small  inhomogeneities  because  the  physical  laws  used  to  derive  equation  (2.34)  do  not  govern  the 
formation  of  eddies  of  these  sizes.  The  inertial  subrange  is  the  region  in  which  energy  is  transferred 
from  large  to  small  eddies  without  appreciable  loss.  The  boundaries  for  which  the  Kolmogorov 
spectrum  is  valid  are  known  as  the  inner  scale,  on  the  order  of  a  few  millimeters,  and  the  outer 
scale,  which  may  be  as  large  as  100  meters  (26).  This  range  of  validity  is  not  of  great  concern,  since 
scale  sizes  larger  than  the  optical  aperture  will  cause  a  uniform,  as  opposed  to  randomly  varying, 
optical  path  length  change  across  the  area  of  the  wavefront  intercepted  by  the  aperture.  A  uniform 
value  of  phase  across  the  aperture  has  no  effect  on  the  imaging  system,  since  it  does  not  represent  a 
deformation  of  the  received  wavefront.  As  for  the  inner  scale  of  the  turbulence,  even  if  the  ability  to 
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correct  for  the  very  high  spatial  frequency,  but  low  power,  deformations  that  result  from  microscale 
turbulence  were  available,  it  would  yield  a  virtually  indiscernible  improvement  in  image  quality. 


2.3.2  Refractive  Index  Structure  Constant.  The  term  “refractive  index  structure  constant” 
can  lead  to  confusion,  since  is  in  fact  a  function  of  geographical  location  and  altitude.  An  indi¬ 
cation  of  the  strength  of  the  optical  turbulence,  the  refractive  index  structure  constant  is  measured 
in  units  of  meters"^^^  (26).  Experimental  data  reveals  that  the  value  of  is  highly  variable  with 
altitude.  For  this  reason,  combined  with  its  dependence  on  geographical  location,  the  structure 
constant  is  difficult  to  characterize.  For  analytical  purposes,  however,  is  often  set  to  a  uniform 
value  over  a  finite  altitude  range. 

Hufnagel  presents  two  additional  profiles  to  aid  in  propagation  calculations  in  cases  where 
a  uniform  model  would  be  too  simplistic  (33).  A  simple  model,  in  terms  of  altitude  h  in  meters 
above  local  ground,  is 


/• 


=  i 


1.5  X  10-‘3 

- 7i - 


0 


below  20,000  m  above  sea  level 
above  20,000  m  above  sea  level 


(2.35) 


In  the  preceding  profile,  local  ground  is  assumed  to  be  <  2500  m  above  sea  level.  Equation  (2.35) 
models  one  of  the  notable  features  of  empirical  data,  which  show  that  the  turbulence  is  strongest 
near  the  ground  and  generally  decreases  with  altitude.  A  more  complicated  formula,  known  as  the 
Hufnagel  model,  captures  a  second  characteristic — a  peak  in  the  turbulence  strength  in  the 

neighborhood  of  the  tropopause: 


C^(z)  =  8.2  X  10-®Vz'°exp(-2/1000)  +  2.7  x  10-*®exp(-2/1500)  (2.36) 


where  v  (m/sec)  is -the  rms  wind  speed  averaged  between  5000  and  20,000  meters  and  z  (m)  is  the 
altitude  above  sea  level.  Strictly  speaking,  the  Hufnagel  model  is  only  valid  above  the  first  strong 
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inversion  layer  (33),  however  it  is  often  used  as  a  sea  level  to  20,000  meter  profile  as  shown  in  the 
plot  in  Figure  2.3. 
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Figure  2.3.  Hufnagel  model  for  u  =  27  m/sec. 

2.3.3  Phase  Sinicture  Function.  According  to  equation  (2.34),  the  functional  form  of  the 
Kolmogorov  refractive  index  power  spectrum  has  a  nonintegrable  pole  at  the  origin,  implying  an 
infinite  variance.  Hence,  a  corresponding  autocorrelation  function  cannot  be  defined.  For  this 
reewon,  structure  functions  are  often  used  to  describe  statistical  quantities  in  atmospheric  optics. 
The  structure  function  of  a  general  random  process  Z{f)  is  defined  as  (26) 

2’z(ri,  r-2)  =  ([Z(ri )  -  Z(f2  )]=*)  (2.37) 

For  homogeneous  and  isotropic  Kolmogorov  turbulence  and  vertical  viewing,  the  refractive  index 
structure  function,  2>„,  is  found  to  be  (26) 

^’r.(r)  =  Cyf^  (2.38) 
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Note  that  Vn  is  finite  for  path  length  separations  near  zero. 

In  this  dissertation,  the  phase  structure  function,  is  used  in  tlie  analysis  of  Chapter  1\'. 
The  derivation  of  the  phase  structure  function  is  mathematically  complex  (26).  The  procedure, 
shown  in  Figure  2.4,  is  to  calculate  the  optical  path  length  for  two  parallel  rays  traversing  the 
atmosphere  and  separated  by  a  distance  r.  The  formula  (2.33)  is  used  to  obtain  expressions  for  the 


•«e -  ^  ^ 


Figure  2.4.  Procedure  used  to  calculate  phase  structure  function  (26) 


two  phases,  i>i  and  V’z.  in  terms  of  the  random  index  of  refraction.  Using  these  results,  can  be 
found  in  terms  of  X>„,  which  in  turn  is  obtainable  from  For  isotropic  Kolmogorov  turbulence, 
the  final  result  is  (26) 

V^{r)  =  2.9lPr^f^  f  (2  39) 

Jo 

where  k  is  the  average  wave  number  of  the  light  and  z  is  the  thickness  of  the  turbulence  region. 
The  above  result  is  valid  for  path  separations  r  within  the  inertial  subrange,  a  restriction  that 
is  usually  of  no  consequence  for  the  reasons  mentioned  at  the  end  of  Section  2.3.1.  Also  notice 
that  equation  (2.39)  shows  that  it  is  the  integrated  turbulence  strength  which  is  of  importance  for 
propagation  calculations. 
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By  substituting  equation  (2.39)  into  equation  (2.17),  the  formula  for  the  average  long  exposure 
OTF  of  the  atmosphere  for  Kolmogorov  turbulence  is  obtained  (26): 


■Wbi/)  =  exp 


-57.4(di/)^/3^-‘/3y'd^C^(0 


(2.40) 


2.3.4  Atmospheric  Coherence  Diameter.  As  illustrated  in  Figuie  2.5,  optical  rays  separated 
laterally  by  more  than  a  few  centimeters  cross  different  collections  of  turbulent  eddies,  resulting 
in  deformation  in  the  wavefront.  As  might  be  expected,  the  turbulence  scale  size  results  in  the 
wavefront  phcise  at  two  points  being  correlated  to  some  degree  if  the  points  are  close  together  and 
uncorrelated  if  the  points  are  widely  separated.  Fried  derived  a  mecisure  of  the  area  over  which 
the  phase  is  nominally  correlated  (17,  18).  This  quantity,  designated  r-Q,  is  known  as  the  Fried 
parameter  or  atmospheric  coherence  diameter. 

The  analysis  which  leads  to  the  definition  of  tq  begins  with  the  calculation  of  the  long  exposure 
transfer  function  of  a  circular  aperture  imaging  through  turbulence.  The  diffraction-limited  OTF 
of  a  circular  aperture  of  diameter  D  is  given  by  (18): 


Wo(/) 


cos-\Xd-JID)  -  {\dJ/D)s/\  -  (\d-JlD)A  XdJ  <  D 


0 


Adi/  >  D 


(2.41) 


By  multiplying  transfer  function  (2.4U)  by  (2.41),  the  long  exposure  OTF  of  a  circular  aperture 
imaging  through  Kolmogorov  turbulence  is  obtained.  One  measure  of  resolution  for  the  turbulence- 
degraded  system  is  the  volume  under  the  OTF.  (By  Fourier  theory,  it  is  also  equal  to  the  peak 
of  the  PSF,  which  is  equivalent  to  a  Strehl  ratio  measure.)  Fried  found  that  by  taking  the  limit 
of  the  integrated  OTF  as  D  goes  to  infinity,  this  resolution  approached  a  majcimum  value.  Fried 
then  defined  ro  as  the  diameter  of  an  equivalent  diffraction-limited  telescope  with  this  same  value 
of  resolution.  It  is  a  function  of  wavelength  and  the  optical  turbulence  strength  integrated  over  the 
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Figure  2.5.  Optical  paths  separated  by  an  amount  greater  than  the  scale  size  of  the  turbulence  tra¬ 
verse  different  collections  of  eddies.  Hence,  the  degree  of  correlation  between  the  phase 
change  experienced  by  two  different  rays  depends  on  their  path  length  separation. 
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optical  path  (26): 


ro  =  0.185 


A" 


3/5 


(2.42) 


where  the  overbar  indicates  an  average  value  and  r  is  the  thickness  of  the  turbulence  region.  Thus, 
To  is  often  more  useful  than  or  C'^  in  that,  as  a  single  number,  it  characterizes  atmospheric 
turbulence  conditions.  Fried  has  shown  how  ro  can  be  interpreted  as  a  measure  of  the  limiting 
resolution  of  an  optical  system  imaging  through  a  random  medium  (18).  Speci^cally,  regardless 
of  the  actual  aperture  size,  the  resolution  of  a  telescope  looking  through  the  atmosphere  is  ap¬ 
proximately  that  of  a  diffraction-limited  telescope  of  diameter  ro.  (Astronomers  still  build  large 
telescopes  to  gather  more  light  from  the  object  of  interest,  thereby  increasing  signal  to  noise  ratio.) 
This  limitation  is  quite  severe,  typical  values  of  ro  range  from  less  than  5  cm  for  poor  observing 
or  “seeing”  conditions  to  20  cm  if  the  seeing  is  “exceptionally  good  (26).”  Measurement  of  ro  is  a 
useful  parameter  for  evaluation  of  potential  observatory  sites.  With  the  preceding  definition  of  ro, 
the  formula  for  the  Kolmogorov  phase  structure  function  simplifies; 


V,t,{r)  =  6.8839 


(^) 


5/3 


(2.43) 


2.3.5  Phase  Power  Spectrum.  The  frequency  domain  analogue  to  the  phase  structure  func¬ 
tion  is  the  phase  power  spectrum  (48): 


^Af) 


0.023 

r5/3yn/3 


(2.44) 


This  function  is  plotted  in  Figure  2.6  for  several  values  of  ro.  From  inspection  of  the  figure,  it  is  clear 
that  most  of  the  power  in  the  phase  fluctuations  is  contained  in  the  lowest  spatial  frequencies.  At  a 
good  observatory  site,  one  would  expect  ro  to  be  >  0.05  most  of  the  time.  Even  for  this  “moderately 
poor”  value  of  ro,  there  is  very  little  power  in  the  phase  fluctuations  at  spatial  frequencies  above 
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5  cycles  per  meter.  This  observation  provides  support  for  the  assumption  that  geometrical  optics 
adequately  models  propagation  down  through  the  atmosphere  at  a  good  observatory  site.  It  has 
been  shown  that  Fresnel  propagation  can  be  described  by  a  transfer  function  (25): 

H(/)  =  exp(-j-irXzf-)  (2.45) 

where  k  is  the  wave  number,  z  is  the  propagation  distance,  and  a  constant  phase  delay  term  has  been 
suppressed.  If  the  argument  of  this  transfer  function  is  plotted  out  to  a  spatial  frequency  of  5  cycles 
per  meter,  as  shown  in  Figure  2.7,  it  is  seen  that  the  quadratic  phase  dispersion  is  insignificant  for 
the  spatial  frequencies  and  propagation  distances  of  interest.  In  studying  Figure  2.7,  keep  in  mind 
that  the  strongest  turbulence  is  at  the  lowest  altitudes,  usually  within  ~  1  km  of  the  telescope.  For 
this  propagation  distance,  the  phase  dispersion  is  of  the  order  of  A/100  or  less.  For  this  reason,  as 
well  as  the  demonstrated  success  of  phase-only  correction  in  adaptive  optics  systems,  virtually  all 
adaptive  optics  calculations  in  the  literature  are  based  on  geometrical  optics. 

2-4  Adaptive  Optics  Imaging  Systems 

An  adaptive  optics  system  performs  two  main  functions:  wavefront  sensing  and  wavefront 
correction.  The  next  two  sections  discuss  these  functions  in  the  context  of  a  telescope  aperture 
with  diameter  D  larger  than  the  atmospheric  coherence  diameter  tq.  For  such  a  telescope,  one  may 
think  of  the  aperture  as  divided  up  into  ~  D^/r^  subapertures.  In  each  of  these  subapertures,  the 
wavefront  deviates  from  a  plane,  having  a  nonzero  slope  (tilt),  curvature,  and  higher  derivatives. 
The  objective  of  current  wavefront  sensing  and  correction  techniques  is  to  sample  the  wavefront 
in  the  aperture  with  a  period  <  ro,  measure  the  subaperture  tilts,  and  reconstruct  an  estimated 
wavefront  which  would  produce  the  measured  slopes. 
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Figure  2.6.  Phase  power  spectrum  for  several  values  of  tq. 
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Figure  2.7.  Argument  of  Fresnel  transfer  function  for  several  propagation  distances  2  (A  =  500 
nm.) 


2-21 


Following  the  sections  on  wavefront  sensing  and  correction.  Section  2.4.3  describes  some  of 
the  sources  of  error  in  adaptive  optics  systems.  Finally,  Section  2.4.4  reviews  some  of  the  latest 
experiments. 

2.4.1  Wavefront  Sensing.  For  a  phase  correcting  adaptive  optics  system,  the  WFS  must 
determine  the  wavefront  deviation  from  an  ideal  plane.  There  is  no  need  to  determine  the  mean 
phase  (known  in  the  adaptive  optics  community  as  “piston”)  since  it  has  no  effect  on  the  image. 

The  most  common  WFS  discussed  in  the  literature  is  the  Hartmann-Shack  (H-S)  WFS.  Shown 
in  Figure  2.8,  the  H-S  WFS  consists  of  a  lenslet  array  and  a  charge-coupled  device  (CCD)  grid  in 
the  focal  plane  of  the  lenslets.  The  H-S  WFS  requires  an  unresolved  point  source  for  best  results. 


If  the  lenslets  are  of  smaller  dieuneter  than  tq,  they  will  focus  a  diffraction-limited  spot  on  the  CCD 
photodetectors.  From  the  displacement  of  the  spot  from  its  ideal  position,  the  local  wavefront  slope 
can  be  calculated. 
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2.^.2  Wavefront  Correction.  At  this  time,  the  most  common  high-bandwidth  wavefront 
correction  device  is  the  continuously  deformable  mirror.  The  surface  of  the  DM  is  perturbed  to 
fit,  eis  much  a.s  possible,  the  original  aberrated  wavefront.  The  relative  amount  of  mirror  surface 
deflection  caused  by  a  single  aperture,  as  a  function  of  distance  from  the  actuator’s  position,  is 
called  the  actuator  influence  function  (29)  or  mirror  response  function  (63). 

The  simplest  type  of  DM  is  a  thin  plate  which  is  deformed  using  an  arrangement  of  electrome¬ 
chanical,  electromagnetic,  or  piezoelectric  actuators.  Thin  plate  DMs  have  a  wide  influence  function 
which  may  overlap  that  of  several  other  actuators,  complicating  the  mirror  control  problem  (29). 

A  type  of  mirror  with  a  narrower  influence  function  is  a  monolithic  mirror  which  incorporates 
the  reflector  surface,  actuators,  and  rigid  backplate  in  one  block  of  material  (29).  In  fact,  the 
interactuator  coupling  for  Litton/Itek’s  monolithic  piezoelectric  mirror  (MPM),  used  at  the  Air 
Force  Maui  Optical  Site  (AMOS),  can  be  adjusted  from  0  to  20%  (12). 

A  third  type  of  DM  with  a  continuous  reflecting  surface  is  the  membrane  mirror.  Tension 
is  maintained  on  the  membrane  to  keep  it  flat,  and  its  surface  is  deformed  with  electrostatic 
forces.  The  interesting  property  of  the  membrane  mirror  is  that  the  force  F  necessary  to  achieve 
a  given  displacement  Z  at  a  distance  r  from  the  center  is  proportional  to  the  Laplacian  of  the 
displacement  (10); 

(2.46) 

where  is  the  Laplacian  operator,  t  is  time,  and  T  is  the  tension/length.  Roddier  has  proposed 
a  wavefront  curvature  sensor  which  measures  the  Laplacian  of  the  wavefront  over  a  subaperture 
instead  of  its  slope  (53).  Its  output  could  be  applied  directly  to  a  membrane  mirror  with  no 
wavefront  reconstruction  necessary.  The  mirror  itself  would  perform  the  reconstruction  since  it 
solves  Poisson’s  equation  (equation  (2.46)).  Such  a  scheme  is  also  possible  with  bimorph  mirrors, 
which  are  constructed  by  gluing  the  reflective  surface  to  a  piece  of  piezoelectric  material  sandwiched 
between  electrodes  (56). 
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In  addition  to  continuous  facesheet  DMs,  there  is  renewed  interest  in  segmented  mirrors.  Each 
segment  of  this  type  of  mirror  is  independently  controllable  with  three  degrees  of  freedom:  tip,  tilt, 
and  piston;  there  is  no  problem  with  influence  functions  cross  coupling  with  other  segments  (34). 

2.4-3  Sources  of  Error  in  Adaptive  Optics  Systems.  The  four  chief  causes  of  systemic  error 
in  adaptive  optics  systems  are  measurement  noise  and  sampling  error,  associated  with  the  WFS; 
fitting  error,  associated  with  the  DM;  and  time  delay  error,  associated  with  the  control  loop.  The 
residual  aperture-average  phase  variance  associated  with  each  of  these  error  sources  (rad*  or  waves") 
is  designated  <7^,  fff ,  and  respectively.  If  these  errors  are  uncorrelated,  the  total  error  due 
to  system  imperfections,  <7^ ,  can  be  found  by  summing  the  variances; 

=  +  +  (2.47) 


Some  integrated  system  analyses  have  shown  that  the  combined  effect  of  various  error  sources  is  less 
than  that  indicated  by  the  quadrature  sum  (14).  This  systemic  wavefront  variance  is  in  addition 
to  residual  phase  error  due  to  anisoplanatism  (59). 


2.4.3.!  Measurement  Noise.  WFS  measurement  noise  arises  from  two  primary  sources; 
photon  noise  and  detector  noise.  Photon  noise  is  a  problem  when  WFS  meeisurements  are  made 
at  low  signal  levels.  The  fewer  the  number  of  photons  collected  by  the  subaperture  per  integration 
time,  the  less  accurate  will  be  the  estimate  of  the  centroid  of  the  focused  spot.  In  fact,  the  wavefront 
phase  measurement  error  variance  due  to  photon  noise  is  inversely  proportional  to  the  number  of 
photons  per  subaperture  per  integration  time,  A^ph: 


((rad/subaperture)^ ) 


(2.48) 
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Researchers  differ  as  to  the  exact  value  of  the  proportionality  constant  for  a  H-S  WFS.  Lincoln 
Laboratory  researchers  give  26.1  as  a  typical  value  (46),  Kern  ei  al.  u.se  19.7  (41),  Genrui  at. 
report  12.2  (24),  which  is  close  to  the  value  of  13.4  used  by  Gardner  ei  al.  in  their  calculations  (22). 

Previous  analyses  of  adaptive  optics  often  assumed  photon-limited  detectors,  that  is,  they 
assumed  that  photon  noise  dominates.  However,  in  practice,  Lincoln  Laboratory  researchers  have 
found  that  detector  noise  dominates.  Detector  noise  may  be  accounted  for  by  adding  a  second 
term,  proportional  to  to  equation  (2.48). 


2. 4-3. 2  Sampling  Error.  If  the  WFS  subapertures  were  vanishingly  small,  the  wave- 
front  could  be  measured  with  perfect  accuracy,  assuming  no  measurement  noise.  However,  due  to 
the  finite  size  of  actual  subapertures,  coupled  with  the  fact  that  only  the  wavefront  tilt  is  mea¬ 
sured,  there  will  be  residual  wavefront  error  resulting  from  the  sampling  process  even  in  noise-free 
conditions.  The  appropriate  Zernike-Kolmogorov  residual  error  formula  derived  by  Noll  gives 
for  a  WFS  that  measures  tilt  (48): 


0-2  =  0.134 


(rad^) 


(2.49) 


where  d  (m)  is  the  subaperture  diameter. 


2. 4.3.3  FitUng  Error.  The  term  “fitting  error”  describes  the  inability  of  the  DM  to 
perfectly  match  the  reference  wavefront,  due  to  the  mirror’s  finite  number  of  degrees  of  freedom. 
Hudgin  gives  a  general  expression  for  this  type  of  error  (32): 


cj  =  k[ 


(rad^) 


(2.50) 
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where  L  (m)  is  the  distance  between  actuators.  The  constant  Jkf  depends  on  actuator  geometry 
and  influence  functions.  Hudgin  calculated  a  value  of  —  1.302  for  a  square  array  of  pi.ston-only 
segments  while  Tyson  lists  a  value  of  0.2  for  a  new  type  of  DM  (59). 


2. 4-3. 4  Time  Delay  Error.  During  the  WFS  integration  time,  combined  with  the  time 
required  for  wavefront  reconstruction  and  mirror  response,  the  optical  turbulence  structure  changes 
slightly.  A  rule  of  thumb  for  time  delay  error  is 


crl  =  lcr 


(rad^) 


(2.51) 


where  u  (m/s)  is  the  transverse  wind  velocity  and  r  (s)  is  the  time  delay.  In  this  case,  the  propor¬ 
tionality  constant  is  a  function  of  the  control  algorithm  (59). 


2.4-4  Experimental  Results.  Evidently,  the  first  diffraction-limited  images  obtained  with  an 
adaptive  optics  system  were  made  in  January,  1976,  using  a  technique  called  image  sharpening  (8, 9). 
The  device  used  was  a  30  x  5  cm  aperture  telescope  which  employed  a  linear  arrangement  of  six 
piston-adjustable  mirrors  to  maximize  the  intensity  of  light  passing  through  a  narrow  slit.  The 
system  required  bright  objects  and  atmospheric  coherence  times  >  7-8  ms.  One  dimensional 
images  of  the  stars  Sirius  and  Arcturus  were  recorded  at  visible  wavelengths. 

Conducting  research  for  the  U.S.  military,  the  Massachusetts  Institute  of  Technology  Lincoln 
Laboratory  began  adaptive  optics  research  in  the  1970s.  In  1982,  the  first  compensated  star  im¬ 
ages  were  made  using  the  Compensated  Imaging  System  at  AMOS,  which  employs  a  168-actuator 
MPM  (28).  Performance  details  for  this  system  have  still  not  been  published  (31). 

It  was  not  until  October,  1989,  that  the  first  diffraction-limited  photographs  of  an  astronom¬ 
ical  object  were  produced  by  civilian  astronomers  (55).  The  components  of  this  prototype  system 
included  a  1.52  m  aperture,  an  H-S  WFS  with  5x5  subapertures,  a  DM  with  19  actuators,  and  a 
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tip-tilt  mirror  for  overall  tilt  compensation.  Images  of  Polaris,  binary  stars,  and  a  satellite  were 
made  using  infrared  light,  with  the  best  results  at  wavelengths  >  2.2/im. 

2.5  Laser  Guide  Stars 

As  discussed  in  Section  2.4.3. 1,  WFS  measurement  noise  for  a  photon-limited  detector  is 
inversely  proportional  to  the  number  of  detected  photons  per  subaperture.  This  result  implies 
that  there  is  a  minimum  required  level  of  brightness  for  a  reference  source  to  be  suitable  for  WFS 
me2isurements.  An  additional  requirement  for  a  WFS  reference  source  is  that  it  be  located  very  close 
to  the  object  of  interest,  so  that  the  light  from  the  reference  travels  through  the  same  atmospheric 
turbulence  structure  as  the  light  from  the  object.  It  turns  out  that  these  two  requirements  are 
so  stringent  at  visible  wavelengths  that  adaptive  optics  can  only  be  used  to  compensate  a  few  of 
the  brightest  natural  stars.  For  adaptive  optics  to  be  useful  to  astronomers,  a  means  of  generating 
a  suitable  WFS  reference  source  at  any  desired  location  is  necessary.  A  layer  of  sodium  (Na) 
atoms,  evidently  the  result  of  meteoric  ablation,  resides  in  a  region  of  the  atmosphere  known  as 
the  mesosphere  at  an  altitude  of  approximately  92  kra  (58).  These  atoms  can  be  made  to  fluoresce 
when  stimulated  by  a  laser  operating  at  a  wavelength  of  589  nm.  In  1985,  Foy  and  Labeyrie, 
unaware  of  ongoing  (at  the  time,  classified)  military  research,  first  proposed  to  the  astronomical 
community  the  concept  of  using  laser  stimulation  of  the  Na  layer  to  generate  reference  beacons  for 
wavefront  sensing  (15).  They  also  suggested  using  Rayleigh  scattered  laser  light  from  molecules  in 
denser,  lower  altitude  regions  of  the  atmosphere. 

There  are  three  problems,  directly  related  to  the  guide  stars  themselves,  facing  the  designer 
of  a  laser  guided  adaptive  telescope;  guide  star  brightness,  guide  star  size,  and  anisoplanatism. 
The  brightness  requirement  translates  to  a  laser  power  requirement,  which  is  currently  the  most 
significant  technological  limiting  factor  for  this  type  of  telescope.  For  the  most  accurate  WFS 
measurements,  the  laser  guide  stars  should  be  as  small  as  natural  stars,  that  is,  they  should  be 
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appear  to  the  telescope  as  an  unresolved  point  source  (angular  radius  <  1.22A/ro)  (58).  In  order 
to  approach  this  goal  as  close  as  possible,  the  laser  itself  may  be  projected  through  the  adaptive 
telescope  where  it  is  precompensated  by  the  DM  (22). 


Anisoplanatism  causes  two  deleterious  effects  which  must  be  accounted  for  in  the  design  of  a 
laser-guided  adaptive  optics  system.  The  first  effect  results  from  the  finite  height  of  the  reference 
stars.  As  shown  in  Figure  2.9,  there  is  an  angular  separation,  a,  between  the  optical  paths  from  the 
reference  and  object  for  virtually  all  points  in  the  aperture.  (In  this  figure,  the  object  is  assumed  to 
be  a  point  source.)  This  effect  is  known  as  focus  anisoplanatism,  because  the  rays  from  the  beacon 
and  object  seem  to  focus  at  different  points.  It  is  anticipated,  but  not  yet  demonstrated,  that  focus 


Figure  2.9.  Focus  anisoplanatism  due  to  finite  height  of  guide  stars. 


anisoplanatism  can  be  mitigated  through  the  use  of  multiple  guide  stars;  the  required  number  is 
determined  via  an  analysis  such  as  that  recently  completed  by  Welsh  and  Gardner  (64).  The  second 
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problem,  which  is  investigated  in  this  dissertation,  is  depicted  in  Figure  2.10.  In  this  case,  plane 


Light  from  object 
point  A 


Figure  2.10.  Illustration  of  FOV  problem;  Two  object  points  are  separated  by  an  angle  /?,  which 
may  be  greater  than  the  isoplanatic  angle. 


waves  from  the  object,  now  an  extended  source,  arrive  from  various  angles,  thus  traversing  different 
regions  of  turbulence  than  the  plane  wave  perpendicular  to  the  telescope  axis.  Hence,  even  though 
an  adaptive  optics  system  with  a  single  DM  may  employ  a  constellation  of  guide  stars  to  correct  the 
entire  aperture,  such  correction  is  only  effective  for  the  on-axis  plane  wave  and  those  from  points 
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with  zenith  angles  less  than  approximately  the  isoplanatic  angle.  Note  that  this  FOV'  problem  is 
also  applicable  to  adaptive  telescopes  which  use  natural  stars  for  wavefront  sensing. 

2.5.1  Expeumenial  Results.  As  of  April,  1991,  as  far  as  the  general  optics  and  astronomical 
communities  knew,  a  laser  guide  star  adaptive  telescope  for  visible  light  was  still  a  futuristic  concept. 
Thompson  and  Gardner  had  demonstrated  the  laser  guide  star  concept  in  1987  (58),  while,  as 
discussed  in  Section  2.2,  European  astronomers  had  obtained  diffraction-limited  infrared  imagery 
in  1989  (55).  These  two  accomplishments  represented  the  state  of  the  art  until,  in  May,  1991,  the 
Air  Force  declassified  the  results  of  a  decade  of  research  (20,  50).  They  announced  the  development 
of  a  1.5-meter  laser  guided  adaptive  telescope  with  which  a  number  of  turbulence-compensated 
astronomical  images  have  been  obtained. 

In  1983,  researchers  at  the  USAF  Phillips  Laboratory  made  the  first  wavefront  measurements 
using  Rayleigh  backscattered  laser  light  (20).  By  comparing  the  wavefront  measurements  using  the 
laser  guide  star  with  those  made  simultaneously  with  the  star  Polaris,  the  utility  of  the  laser  guide 
star  concept  was  demonstrated.  These  experiments  were  also  used  to  validate  the  theory  which 
predicted  the  magnitude  of  the  focus  anisoplanatism  error. 

In  1984-1985,  similar  experiments  were  conducted  by  Lincoln  Laboratory  using  sodium  guide 
stars  (35).  This  experiment  provided  the  first  empirical  data  showing  that  focus  anisoplanatism  is 
less  severe  for  a  sodium  guide  star  than  for  a  lower-altitude  Rayleigh  guide  star. 

Thompson  and  Gardner  were  the  first  civilian  researchers  to  field  test  the  laser  guide  star 
concept  (58).  In  1987,  they  produced  and  imaged  a  Na  guide  star  with  an  apparent  brightness 
of  M„  =  14.4  on  the  visual  magnitude  scale  used  by  astronomers.  This  guide  star  was  not  bright 
enough  for  wavefront  sensing  and  no  attempt  was  made  to  achieve  the  guide  star  size  criterion. 

The  first  astronomical  images  compensated  using  a  laser  guide  star  were  produced  in  1988 
by  Lincoln  Laboratory  at  AMOS  (50).  They  compensated  a  60  cm  telescope  with  an  open-loop 
adaptive  optics  system  consisting  of  256-8ubaperture  Hartmann  WFS,  a  241  actuator  DM,  and  a 
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Kayleigh  guide  star  at  an  altitude  of  4-8  km.  The  column  of  air  used  for  the  synthetic  beacon  was 
0.7.5- 1.0  km  long.  Measured  Strehl  ratios  for  the  star  Procyon  averaged  ~  0.3  at  visible  wavelengths 
vs.  ~  0.05  with  no  adaptive  optics  compensation. 

In  1989,  researchers  at  the  Phillips  Laboratory  aciiieved  closed  loop  adaptive  optics  com¬ 
pensation  on  a  1.5  m  telescope  using  a  guide  star  range  of  10  km  (20).  Full-width-half-rnaxinium 
resolution  of  0.18  arcsec  at  visible  wavelengths  was  achieved  in  2  arcsec  seeing  conditions.  This 
level  of  improvement  allowed  the  two  stars  of  the  binary  star  system  53  ^  Ursa  Majoris  (separation 
1.3  arcsec)  to  be  resolved  in  a  1-second  exposure. 

The  latest  significant  development  occurred  in  1990,  when  the  first  multiple  synthetic  beacon 
adaptive  optics  system  was  demonstrated  (65).  In  this  experiment,  the  wavefront  measurements 
from  two  Rayleigh  guide  stars  were  combined  to  compensate  an  image  of  the  star  Pollux.  Because 
only  a  60-cm  aperture  was  used,  the  reduction  in  focus  anisoplanatism  was  not  apparent.  However, 
this  experiment  served  to  demonstrate  the  feasibility  of  multiple  guide  star  systems. 

2.6  Summary 

Over  the  past  several  decades,  techniques  for  rigorous  ant  lysis  of  the  problem  of  imaging 
through  the  atmosphere  have  been  developed,  refined,  and  validated  with  experimental  results. 
Meanwhile,  advances  in  electronic  and  optical  hardware  have  resulted  in  successful  application 
of  this  theory  to  obtain  nearly  diffraction-limited  resolution,  over  small  fields,  with  ground-btised 
telescopes.  Increasing  the  corrected  field  of  view  of  these  telescopes  is  one  of  the  next  major 
problems  of  interest  to  the  community.  A  detailed  analysis  of  this  problem  begins  with  the  next 
chapter. 
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III.  Frequency  Domain  Analy.sis 


3.1  Introduction 

In  this  chapter  the  problem  of  estimating  the  contributions  of  different  parts  of  the  atmosphere 
to  optical  wavefront  aberration  is  addressed  from  a  frequency  domain  point  of  view.  Motivation  for 
a  frequency  domain  analysis  stems  from  Beckers’  suggestion  that  the  contribution  of  a  particular 
layer  of  the  atmosphere  to  the  total  phase  may  be  estimated  as  a  linear  combination  of  weighted  and 
shifted  wavefront  measurements  (5).  By  transforming  into  the  frequency  domain,  the  spatial  shifts 
become  multiplicative  complex  exponentials.  The  result  is  a  linear  system  of  equations  with  the 
layer  contributions  as  unknowns.  If  the  number  of  measurements  equals  the  number  of  turbulent 
layers,  the  system  may  be  inverted  to  obtain  the  layer  contributions.  Using  Figure  3.1,  this  approach 
will  be  illustrated  for  the  case  of  a  2-layer  atmosphere  with  noise-free  measurements. 

In  this  chapter,  it  is  assumed  that  the  WFS  can  directly  measure  the  total  phase  aberration 
function  in  the  aperture  plane,  where  m  indicates  the  reference  source  (guide  star).  In 

the  example  illustrated  in  Figure  3.1,  two  such  measurements  are  made,  first  with  guide  star  1  and 
then  with  guide  star  2.  It  is  assumed  that  the  atmosphere  does  not  change  at  all  during  the  time 
it  takes  to  make  the  two  measurements.  Using  geometrical  optics  and  assuming  the  guide  stars  are 
at  an  infinite  altitude,  the  two  measurements  may  be  written 

V't,(x)  =  +  Z\ji)  +  +  Z\ji)  (3.1) 

=  V'i,(?-f  21,^2)  +  +  ^b^z)  (3.2) 

where  6m  is  the  angular  position  of  guide  star  m,  is  the  altitude  of  turbulent  layer  f,  and 
is  the  contribution  of  turbulent  layer  i  to  the  total  phase  aberration.  Throughout  this  dissertation, 
this  contribution  is  referred  to  as  wavefront  “component”  i.  Note  that  a  small  angle  approximation 
was  used  in  writing  the  preceding  equations.  Transforming  the  this  system  of  equations  into  the 
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frequency  domain: 


i’tAf)  =  exp(j27r2|,/tfi)V'i,(/)  +  exp(i2xc:i,/-^i)V'i2(/)  (3.3) 

=  exp(i2:r2i./- 02)^1, (/)  +  «‘xp(j27r2i,/-^2)V’i,(/)  (3.4) 

where  the  tilde  indicates  the  Fourier  transform.  By  inverting  the  preceding  system,  the  transform  of 
the  turbulent  layer  contributions  may  be  obtained  in  terms  of  the  transforms  of  the  measurements. 
For  turbulent  layer  1  the  result  is 


i'uif)  =  ^Wl.(/)V’t,(/)  +  ^W,j(/)t(’t2(/) 


(3.5) 


where 


^w,,(/ ) 


Aw.,(/) 


_ exp(j2T2|  J  •  62) _ 

exp[j2T/ •  (21,^1  +  2|j^2)]  -  exp[j2;r/ •  (21,^2  + 

_ -exp(j27r2|^/gi) _ 

exp[i2?r/.  (21, +  21,^2)]  -  exp[;2;r/-  (21,^2  +  zx^Bi)] 


(3.6) 

(3.7) 


The  magnitude  of  these  complex  weights  is 


|Aw,i  I  —  l^wial  — 


_ 1 _ 

2sin|x  (21,  -  2j,)/ •  (^2  -  1^1)  } 


(3.8) 


From  the  preceding  equation,  it  is  seen  that  these  weighting  functions,  hw,^(/ ),  have  a  singularity 
at  the  origin.  As  a  result,  the  low  frequency  components  of  any  noise  in  the  measurements  will  be 
amplified  and  the  estimates  of  the  turbulent  layer  contributions  will  be  poor.  To  deal  with  noisy 
measurements,  a  minimum  mean  square  error  filtering  approach  is  taken  in  this  chapter.  Assumed 
a  fTiori  knowledge  statistics  of  the  phase  and  noise  random  processes  are  used  to  obtain  optimum 
weights. 
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Several  simplifying  assumptions  are  necessary  in  order  to  analyze  this  problem  in  the  fre¬ 
quency  domain.  As  in  the  preceding  example,  it  is  assumed  throughout  this  chapter  that  measure¬ 
ments  of  the  total  phase  are  available,  which  is  not  usually  true  in  practice.  Also,  the  finite  size  of 
the  telescope  aperture  is  not  taken  into  account.  Adaptive  optical  components  such  as  the  WFS 
and  tilt  removal  system  are  modeled  approximately  as  linear  spatial  filters.  Finally,  phase  statistics 
are  described  with  a  Gaussian  correlation  function  instead  of  the  Kolmogorov  structure  function. 
Because  of  these  assumptions,  and  because  the  analysis  is  mathematically  cumbersome,  the  results 
of  this  frequency  domain  analysis  are  more  suitable  for  qualitative  understanding  of  the  estimation 
problem  rather  than  for  quantitative  performance  predictions.  All  of  the  simplifying  '•^sumptions 
listed  in  this  paragraph  are  dropped  in  the  spatial  domain  analysis  of  Chapter  IV. 

In  Section  3.2,  the  atmospheric  model  used  in  this  chapter  is  presented  while  Section  3.3 
outlines  the  geometry  of  the  problem.  Section  3.4  defines  linear  models  for  wavefront  propagation, 
piston  and  overall  tilt  removal,  wavefront  measurement,  and  wavefront  component  estimation.  The 
analysis  is  performed  in  Sections  3.5  and  3.6,  while  the  performance  of  the  estimator  is  evaluated  for 
several  choices  of  parameters  in  Section  3.7.  Finally,  an  example  problem  is  worked  in  Section  3.8 
and  a  summary  is  in  Section  3.9. 

3.S  Atmospheric  Model 

Experimental  evidence  indicates  that  optical  turbulence  in  the  atmosphere  is  often  concen¬ 
trated  in  a  small  number  of  layers  (52,  61).  In  a  recent  paper,  McKechnie  presented  a  theoretical 
framework  for  a  layered  atmospheric  model,  in  which  he  collapsed  the  effects  of  each  layer  into  a 
phase  screen  (45).  Based  on  these  experimental  and  theoretical  results,  the  validity  of  an  atmo¬ 
spheric  model  consisting  of  /  statistically  independent  phase  screens  is  assumed.  As  indicated  in 
Chapter  I,  it  is  assumed  that  geometrical  optics  may  be  used  to  describe  the  propagation  of  light 
through  down  through  the  atmosphere  at  the  observatory  site. 
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As  described  in  the  previous  section,  a  phase  perturbation  function  ‘4’i,{x)  is  associated  with 
each  phase  screen  in  this  atmospheric  model.  This  function  is  assumed  to  be  a  sample  function  of 
a  zero-mean,  spatially  stationary  random  process,  described  by  a  phase  variance,  (rad"),  and 
spatial  correlation  function 

r)>  (3-9) 

where  r  =  |r  |  (m)  is  the  distance  between  two  points  in  the  plane  and  angle  brackets  indicate 
ensemble  averaging.  Following  Barakat  and  Beletic  (3),  it  is  assumed  that  the  spatial  statistics  of 
the  phase  screens  may  be  described  by  Gaussian  correlation  functions: 

r^,  (r)  =  exp  (rad^)  (3.10) 

The  parameter  pi,  (m)  is  the  wavefront  correlation  length  associated  with  each  phase  screen.  Also 
of  interest  is  the  power  spectral  density  (PSD)  of  the  phase  screen, 


if)  =  Pu  exp(-irVu/")  ((rad-m)2) 


(3.11) 


which  is  the  Fourier  transform  of  the  autocorrelation  function  with  /  (m  ^)  being  the  spatial 
frequency  variable. 


Some  researchers  may  question  the  validity  of  the  Gaussian  autocorrelation  function  (3.10) 
in  describing  atmospheric  statistics.  Normally  the  Kolmogorov  structure  function,  equation  (2.43) 
is  used,  because  it  is  well  supported  by  experimental  data.  However,  for  proper  choice  of  the 
parameters  and  pi,,  the  structure  function  which  corresponds  to  the  Gaussian  correlation 
function  of  equation  (3.10), 


T>G{r)  = 


1  —  exp 


(3.12) 
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is  similar  in  shape  except  it  approaches  a  finite  variance,  in  contrast  to  the  infinite  variance  as¬ 
sociated  with  Kolmogorov  statistics.  In  this  sense,  the  Gaussian  autocorrelation  function  is  more 
physically  intuitive.  By  equating  expressions  (2.43)  and  (3.12)  at  the  point  r  =  p\^,  the  following 
formula  relates  tq  and  pi, : 


ro  =  2.76 


PI. 

6/5 


(3.13) 


Figure  3.2  is  a  representative  plot  of  the  two  structure  functions  related  by  equation  (3.13),  out  to 
an  approximate  outer  scale  of  1  meter.  Strictly  speaking,  the  Kolmogorov  structure  function  is  not 
valid  for  points  separated  by  a  distance  greater  than  the  outer  scale.  It  may  be  argued  that  the 
Gaussian  structure  function  is  as  good  a  model  as  any  for  larger  scale  sizes,  given  the  present  state 
of  knowledge.  It  should  be  noted  that  the  correlation  length,  pi,,  and  the  atmospheric  coherence 
diameter,  tq,  describe  different  wavefront  characteristics  (60). 


0.1  0.2  0.3  0.4  0.5  0.6  0.7  0.8  0.9  1 

r  (m) 

Figure  3.2.  Comparison  of  Kolmogorov  and  Gaussian  structure  functions 


It  is  useful  to  relate  the  strength  of  each  turbulent  layer  to  ro,  the  most  common  measure 
of  overall  seeing  conditions.  In  Chapter  II  it  was  shown  that  for  zenith  viewing  at  a  particular 
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wavelength, 


=  l  r  d^CliO 

JQ 


(3.14) 


where  7  is  a  constant  and  r  is  the  upper  bound  of  the  turbulence  region.  If  the  turbulence  region 
is  divided  into  K  contiguous  slabs, the  preceding  equation  may  be  rewritten  as  follows: 

1=1 

t=l 

where  slab  k  lies  between  the  altitudes  Zk-\  and  zt-  The  definition 

-3/5 

(3.16) 

is  due  to  Beckers  (5).  In  addition,  McKechnie  showed  that  the  total  wavefront  correlation  function  is 
equal  to  the  sum  of  the  individual  correlation  functions  (45).  Using  this  result  and  equation  (3.15), 
the  statistical  averages  of  of  the  overall  atmosphere  can  be  related  to  those  of  the  component  layers. 

3.3  Geometry 

Figure  3.3(a)  shows  two  of  M  reference  sources  located  well  above  the  optical  turbulence 
region  of  the  atmosphere,  of  which  two  of  I  turbulent  layers  are  depicted.  It  is  assumed  that 
the  altitude  of  the  reference  sources  is  high  enough  that  the  waves  emanating  from  them  can  be 
considered  planar.  The  reference  stars  are  somehow  multiplexed  so  that  all  M  phase  perturbation 
measurements  are  made  within  the  atmosphere’s  correlation  time  (27),  using  a  single  WFS.  The 
problem  is,  based  on  such  measurements,  to  determine  the  phase  perturbation  function,  ip\^{x), 
associated  with  each  layer.  The  coordinate  system  used  for  this  analysis  is  shown  in  Figure  3.3(b). 
The  position  of  each  reference  star  is  described  by  a  two-dimensional  vector  0^  =  (^m*  .^m,).  The 
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components  of  this  vector  are  angular  offsets  from  the  telescope  (2)  axis  in  the  x  and  y  directions 
and  are  related  to  the  zenith  angle  6  by 


^  =  tan  ^  (tan“  +  tan' 


(3.17) 


(b) 


Figure  3.3.  Location  and  geometry  of  reference  sources. 


Recall  that  in  this  chapter  the  implications  of  a  finite  telescope  aperture  are  not  being  ad¬ 
dressed.  As  discussed  in  Chapter  IV,  an  important  consideration  is  that  the  guide  stars  must  be 
close  enough  together  to  project  a  large  common  portion  of  each  turbulent  layer  onto  the  aperture. 


3.4  Signal  Processing 

In  the  following  sections,  a  linear  model  for  wavefront  propagation,  overall  tilt  removal,  wave- 
front  measurement,  and  wavefront  component  estimation  is  presented.  The  model  assumes  that 
the  wavefront  sensor  output  is  a  continuous  representation  of  the  wavefront  rather  than  a  discrete 
array  of  slope  measurements.  This  approach  avoids  the  numerical  complexities  of  large  matrix  cal- 
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culations  {(13,  63)  and  Chapter  IV)  and  allows  intuitive  understanding  of  the  wavefront  component 
estimation  problem.  The  various  functions  performed  by  the  adaptive  optics  system  are  modeled 
as  spatial  filtering  operations  (59),  allowing  insight  to  be  gained  into  the  effect  of  different  system 
parameters  on  the  accuracy  of  the  estimates.  The  analysis  of  Chapter  IV  follows  a  more  rigorous 
formalism  along  the  lines  of  reference  (63). 

3.4-  J  Wavefront  Propagation.  Using  a  small  angle  approximation  and  referring  to  Fig¬ 

ure  3.3,  it  can  be  seen  that  the  plane  wave  from  reference  source  m  projects  V’i,(^)  onto  the 
aperture  plane,  shifted  by  an  amount  dependent  on  the  layer  altitude  and  guide  star  position.  As¬ 
suming  the  validity  of  the  geometrical  optics  approach,  the  total  perturbation,  ),  of  a  plane 

wave  from  reference  source  m  propagating  through  I  layers  is 

/ 

V’t„(l)  =  +  (3.18) 

»=1 

3.4-  S  Piston  and  Overall  Tilt  Removal.  The  average  value  of  the  phase  aberration  function 
over  the  area  of  the  telescope  aperture  is  known  as  the  piston  mode  of  the  wavefront.  The  piston 
mode  is  not  considered  an  aberration  and  does  not  need  to  be  estimated  and  compensated  by 
an  adaptive  optics  system.  To  obtain  accurate  results  in  adaptive  optics  calculations,  the  piston 
mode  must  be  removed  from  all  wavefront  models.  Otherwise,  adaptive  optics  system  performance 
predictions  may  be  too  pessimistic  because  piston  estimation  and  correction  errors  are  included. 
In  addition  to  the  piston  mode,  the  wavefront  intercepted  by  the  telescope  aperture  will  almost 
always  have  a  significant  nonzero  aperture-averaged  gradient.  Usually,  an  adaptive  optics  system 
removes  this  overall  tilt  with  a  separate  feedback  mech2uiism  from  the  DM.  One  reason  for  this 
approach  is  that  most  deformable  mirrors  lack  the  dynamic  range  to  remove  substantial  overall 
tilt  components.  In  keeping  with  the  linear  systems  approach,  overall  piston  and  tilt  removal  is 
approximated  by  attenuation  of  the  lowest  spatial  frequencies  in  the  wavefront.  Tyson  finds  that  in 
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such  a  spatial  filtering  method,  tilt  is  seen  as  having  a  spatial  frequency  content  of  “about  1  /2  cycle 
per  mirror  diameter  (59).”  Since  the  amount  of  piston  and  overall  tilt  depends  on  the  aperture 
size,  the  piston  and  tilt  removal  operation  is  modeled  as  a  convolution  of  the  total  wavefront  with 
an  impulse  response  function  h^(x)  which  depends  on  the  effective  telescope  aperture.  Letting  the 
notation  ip(x)  represent  a  tilt-removed  wavefront, 

I 

i  =  l 
1 

=  51  +-"'/">)  (3.19) 

«=i 

where  ♦  indicates  two-dimensional  convolution  and  ip\\x)  =  i’\,(,x)  *  h^(x)  is  the  piston-  and  tilt- 
removed  phase  of  the  itfa  layer.  The  function  <pt„,(x)  represents  the  wavefront  to  be  measured. 
The  remainder  of  the  analysis  deals  only  with  piston-  and  tilt-removed  wavefronts. 

3.4  3  Wavefront  Measurement.  Typically,  wavefront  sensors  used  in  practical  adaptive  op¬ 
tics  systems  measure  the  gradient  of  the  total  wavefront  instead  of  the  actual  phase.  The  estimate 
of  the  wavefront  is  then  reconstructed  on  the  surface  of  the  DM  via  a  summation  of  the  influence 
functions  which  describe  the  spatial  response  of  each  mirror  actuator.  In  this  chapter,  these  two 
processes  are  combined  and  the  adaptive  optics  system  is  modeled  as  measuring  the  wavefront 
phase  directly.  This  wavefront  “measurement”  will  have  two  chief  sources  of  error:  sampling  error 
and  measurement  noise.  Both  of  these  error  sources  are  related  to  the  size  of  the  WFS  subapertures 
in  an  adaptive  optics  system. 

3.4.3. 1  Sampling  Error.  Actual  wavefront  sensors  usually  divide  the  telescope  aper¬ 
ture  into  an  array  of  subapertures,  and  the  wavefront  slope  is  measured  over  each  subaperture. 
Because  of  the  finite  size  of  the  subaperture,  the  WFS  cannot  measure  the  phase  gradient  at  a 
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single  point.  Instead,  the  WFS  determines  the  average  gradient  over  the  subaperture.  Similarly, 
the  DM  cannot  reconstruct  the  highest  spatial  frequencies  in  the  wavefront  due  to  the  spatial  sepa¬ 
ration  of  the  actuator  positions,  which  is  approximately  equal  to  the  size  of  the  WFS  subapertures. 
These  inaccuracies  are  modeled  as  a  sampling  error  in  the  wavefront  measurement  process.  In  this 
analysis  these  spatial  limitations  are  described  with  an  impulse  response  function,  hs^(x),  related 
to  the  subaperture  size,  so  that  the  output  of  the  wavefront  measurement/reconstruction  system 
can  be  written 

=  (3.20) 

Because  of  the  fact  that  actual  wavefront  sensors  measure  the  average  wavefront  gradient  over  the 
subaperture,  the  form  of  han{x)  is  chosen  to  implement  subaperture  wavefront  averaging.  Hence, 
ifitnix)  represents  the  measured  wavefront,  with  the  overbar  indicating  the  average  phase  over  the 
subaperture.  In  this  model,  the  subapertures  are  not  fixed  to  a  grid  of  points,  rather,  a  continuous 
output  is  assumed.  As  indicated  previously,  this  approximation  simplifies  the  analysis  by  allowing 
the  analysis  to  be  conducted  in  terms  of  continuous  functions. 

3. 4-3. 2  Measurement  Noise.  Due  primarily  to  photon  noise,  the  WFS  measurements 
will  be  somewhat  in  error.  This  effect  is  modeled  with  a  continuous,  zero  mean  random  process, 
n{x)  (rad),  which  is  independent  of  the  random  processes  In  an  actual  wavefront  sensor, 

the  measurement  noise  is  uncorrelated  from  subaperture  to  subaperture  (62).  For  this  reason,  the 
noise  is  not  expected  to  contribute  significantly  to  the  lowest  spatial  frequencies  in  the  wavefront 
aberration  function.  Hence,  the  noise  is  modeled  as  white,  except  for  attenuation  of  the  lowest 
spatial  frequencies  according  to  the  size  of  the  telescope  aperture: 

Jl{x)-n{x)*h„(x)  (3.21) 

where  Tt{x)  (rad)  is  the  “tilt-removed”  noise. 
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To  complete  the  model  of  the  wavefront  sensor,  the  second  order  statistics  of  the  noise  need 
to  be  further  specified  by  selecting  an  appropriate  value,  M ,  for  the  constant  noise  power  spectral 
density.  If  the  mean  square  error  in  the  WFS  measurements  is  (rad"),  then  the  corresponding 
noise  power  density  level  is 

2 

- - -  (3.22) 

J  d/lM/)P|fisa(/)|' 

where  h  indicates  the  Fourier  transform  of  h  and  /  =  (frify)  (rn'M  is  a  spatial  frequency  vector. 
The  tilt-removed  noise  PSD  is  then 

<f,(/)  =  Ar|M/)|"  ((rad-m)2)  (3.23) 

The  final  output  of  the  measurement  system,  (p[^{x),  is  the  average  noise-contaminated 
wavefront  over  a  subaperture: 


^L(*)  =  +  (3.24) 

where  Tfm(x)  is  the  noise  in  the  mth  wavefront  measurement. 

3.4-4  Component  Estimation.  Beckers  suggested  collecting  an  ensemble  of  wavefront  mea¬ 
surements  given  by  equation  (3.24),  spatially  shifting  them  so  that  the  component  from  a  particular 
turbulent  layer  will  be  properly  aligned  in  each  measurement,  and  then  averaging  the  ensemble. 
He  expected  that  the  result  of  such  an  averaging  process  will  be  strongly  weighted  toward  the 
contribution  from  the  turbulent  layer  of  interest,  with  contributions  from  other  layers  tending  to 
average  out.  This  idea  is  generalized  to  a  goal  of  finding  the  optimum  combination  of  amplitude 
weighting  and  shifting  which  needs  to  be  applied  to  each  measurement  in  the  linear  combination. 
Since  amplitude  weighting  and  spatial  shifting  are  linear  shift-invariant  operations,  the  estimate  of 
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)  takes  the  following  form: 


M 


■Pi 


,(f)  = 


m=l 

Af 

c 

m=:l 


=  E 


N 


Y',  +  Z\,.^m)  +  r)m(x) 


.i‘  =  l 


*  hsai^)  *  ^W,m(^) 


(3.25) 


where  hy,,^{x)  represents  the  amplitude  weighting  and  shifting  applied  to  wavefront  sensor  mea- 
s’lrement  m  in  the  estimate  of  wavefront  component  The  task  at  hand  is  to  find  the  optimum 

set  of  functions  )• 


3.5  Mean  Square  Error  Calculation 

The  goal  is  to  minimize  the  mean  square  error  in  the  estimate.  Using  angle  brackets  to  signify 
expectation,  the  mean  square  difference  between  the  estimated  and  actual  wavefront  phase  from 
the  tth  turbulent  layer  is  defined  as: 

e^(x)  =  (bi.C*)- W.(i)f) 

=  +  (<^P.(*)>  -  2  {^i.(x)<Pi.(x)) 

=  r^..(0)  +  r^..(0)-2r^,,^,,(0) 

=  +  J 2  J d/%,^.,(/)  (3.26) 

The  double  subscripts  indicate  cross  correlation  functions  or  cross  power  spectra,  as  appropriate. 
If  it  assumed  that  the  second  order  statistics  (i.e.  )  of  each  phase  screen  are  known  a  priori, 

then  the  quantities  in  equation  (3.26)  can  be  calculated  (i.e.  F,^,^,  ,  F,^,  ,  ,  and 
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$01  ).  Using  equation  (3.25),  the  autocorrelation  function  of  <^i,{x  )  is  found: 


M  (  M  I 

=  ^  +  2l.,(Cn  „(■?)  +  /»«-. „,(-a")  + 

m=l  l.m'  =  l  i'  =  l 

I'll  (^  )  *  ^W,m  \  )  *  ^sa(  ^  )  (3  -2  <  ) 


In  arriving  at  equation  (3.27),  the  mutual  independence  of  the  random  process  rp\,{^)  and  n„i{x) 
was  used.  The  derivation  of  this  expression  is  found  in  Appendix  A. 

The  Fourier  transform  of  equation  (3.27)  is  the  power  spectral  density  of  <^\,{x): 


M 


|Asa(/)P  E 


m=l 


PA/  / 

I 

,m'  =  l  «'=1 


(3.28) 


In  the  above  equation,  him  =  exp[;2?rzi,(^m  •  /)].  and  h'  is  the  complex  conjugate  of  h.  Having 
derived  equation  (3.28),  it  is  less  difficult  to  obtain  the  cross  power  spectrum  ): 

M 

)  =  %,,(fyhs.(f)  Y,  him(/)hw.„(/)  (3.29) 

m=l 

The  power  spectra  in  (3.28)  and  (3.29)  may  be  substituted  in  equation  (3.26)  to  obtain  a  general 
expression  for  the  mean  squared  error  in  estimating  the  wavefront  contribution  of  one  of  I  turbulent 
layers  using  measurements  made  from  M  different  reference  sources. 


3.6  Derivaiion  of  Optimum  Weights 

For  illustration  purposes,  the  case  of  two  turbulent  layers  and  two  reference  sources  is  con¬ 
sidered.  The  calculations  can  be  generalized  for  M  >  /  >  2,  but,  as  the  previous  section  indicates. 
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loiigtliy  <'X;»r"ssi()ti!s  arc  involved.  LcUing  M  =  I  z:i  '2,  tiic  general  resnlls  of  l.lie  iireceding  seel  ion 
are  used  l.o  obtain  the  nii‘an-s(|uar('  error  in  tlie  i‘stiinalion  for  this  e;ise; 


I 1  +  j + 

J J  (Ifrdfy  l/isiil"(l/twur"  +  + 


(;{.:$()) 


Iler<',  tin'  fact  that  =  1  w;is  used,  and  tlie  (fr,fy)  <leiM'ndenc('  was  suppressed  to  shorten 

the  expression.  'I'he  goal  is  to  find  /iw,,  and  hv/fj  miiiiniize  < 'lb  sitni)lify  the  niatheniatics  five 
assumptions  ar(‘  made: 


= 

(3.31) 

= 

-e-i 

(3.32) 

= 

2:,. 

(3.33) 

hsa(/ )  = 

1 

/r  t  /y  <  /sa 

(3.34) 

0 

otherwise 

/*a(/ )  =  ^ 

0 

Jxi  fy  fa 

(3.35) 

1 

otherwise 

Assumption  (3.31)  is  not  realistic  for  typical  atmospheric  conditions.  Usually,  the  turbulent  layer 
nearest  the  ground  is  stronger  than  the  one  at  higher  altitude.  However,  using  this  assumption,  the 
results  should  represent  a  median  value  for  c.  That  is,  the  results  are  expected  to  be  better  than 
if  one  were  trying  to  estimate  the  wavefront  due  to  the  weaker  of  a  pair  of  turbulent  layers,  but 
worse  than  if  one  were  estimating  that  of  the  stronger  of  the  two.  Assumptions  (3.32)  and  (3.33) 
allow  the  use  of  trigonometric  identities  to  simplify  the  mathematics.  Physically,  these  assumptions 
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specify  that  the  guide  stars  are  placed  symmetrically  about  the  optical  axis  and  that  the  highest 
turbulent  layer  is  located  at  twice  the  altitude  of  the  lowest.  A  consequence  of  assumption  (3.34) 
is  that  the  impulse  response  of  the  WFS  is 


Asa(i,  y)  -  4/sa 


sin(27r/saa?)  sin(27r/say) 


(3.36) 


A  measure  of  the  width  of  /isa(*)  is 


(3.37) 


The  parameter  d  is  equivalent  to  the  subaperture  spacing  in  a  WFS.  Assumption  (3.35),  models 
piston  and  tilt  removal  as  the  attenuation  of  the  lowest  spatial  frequencies  in  the  wavefront  random 
process.  The  cutoff  frequency  /a  is  related  to  the  aperture  diameter  Z?  in  a  manner  similar  to 
equation  (3.37): 


(3.38) 


Using  the  assumptions  given  in  (3.31)  through  (3.35),  a  somewhat  simpler  expression  for  the 
mean  square  error  is  obtained: 

+  Re{hw..h;„(h?i  +  ht:)}  - 
(hwnhii  +  +  (|hw..P  +  Ihw.J")  (3.39) 

Appendix  A  contains  the  details  of  this  simplification.  Having  simplified  the  mean  square  error 
expression,  the  next  step  is  to  look  for  hw,,  and  h*,,  to  minimize  f^. 

From  inspection  of  equation  (3.39),  it  is  seen  that  it  is  necessary  that 


= 


+ 


/;sa  r/sa  I 

/  df,dfyl2% 

•  /«A  J  —  I 


Im{hwii  fill  d"  fiwijhii }  —  9 


(3.40) 
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since  the  mean  square  error  calculation  must  result  in  a  real  number.  The  above  requirement  leads 
to  a  coupling  between  the  functions  hw,,  and  Awu- 

+  ^wis}  (3.41) 

To  obtain  the  optimum  functions,  equation  (3.41)  is  substituted  into  (3.39),  and  the  following 
system  of  equations  is  solved  to  get  the  real  and  imaginary  parts  of  each  weight: 


dl  _  dl  _  dl 
aim{hw„}  aRe{hw,j}  91m{hw,J 


(3.42) 


where  I  represents  the  term  inside  the  large  curly  braces  in  equation  (3.39).  The  real  part  of  ft*,, 
is  then  obtained  from  (3.41).  A  symbolic  mathematics  software  package  was  extremely  useful  in 
performing  the  calculations  and  simplifying  the  results  (44).  With  its  aid,  the  real  and  imaginary 
parts  of  the  optimum  weights  are  found  to  be 


Re{hw,,}  =  Rejftwtj}  = 


Re{hii}  +  4Im^{ftu}<l>^,^ 

+4Im^{hii}$2_^ 


(3.43) 


Im{An}  (2-4Re“{ftu})$2 

Im{/i*„}  =  Im{/iwiJ  = - — —  - ■■  2,7  7^2 - 

^2  -I-  4Im''{/iii}$2_^ 


(3.44) 


Substitution  of  equations  (3.43)  and  (3.44)  into  (3.39)  and  further  use  of  computer  algebra 
software  leads  to  the  final  expression  for  the  mean  square  error  in  estimating  the  wavefront  due  to 
one  of  two  turbulent  layers  under  assumptions  (3.31)  through  (3.35): 


(3.45) 
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3. 7  Performance  Plots 


The  performance  of  the  estimator  may  be  evaluated  by  substituting  different  values  of  wave- 
front  variance  er^,^  ,  measurement  noise  variance  wavefront  correlation  length  pi,,  wavefront 
sensor  spatial  bandwidth  /sa,  tilt  bandwidth  /a,  turbulent  layer  altitude  ri,,  and  guide  star  posi¬ 
tion  $1  into  equation  (3.45).  Results  of  the  computations  are  plotted  in  Figures  3.4  through  3.8. 
Each  figure  is  intended  to  illustrate  the  effect  of  varying  a  different  parameter  on  the  accuracy  of 
the  estimate,  with  the  other  parameters  held  constant.  The  curves  are  plotted  against  a  “seeing- 
to-error”  (SER)  ratio; 

SER  =  201og^  (3.46) 

The  SER  ir  intended  to  show  how  much  of  the  measured  phase  fluctuations  are  due  to  the  seeing 
layer  vs.  the  measurement  error.  The  term  “signal-to-noise”  ratio  is  avoided  since  intensity  levels 
are  not  being  compared;  however  the  SER  plays  a  similar  role  in  this  estimation  problem. 

3. 7.1  Tilt  Removal.  Tyson  modeled  an  adaptive  optics  system  as  a  two-dimensional  spatial 
filter.  As  mentioned  previously,  he  found  that  tilt  may  be  viewed  as  having  a  spatial  frequency 
content  up  to  about  1/2  cycle  per  telescope  diameter  (59).  With  this  guideline,  a  range  of  values  for 
tilt  bandwidth,  /»  was  selected  and  the  results  plotted  in  Figures  3.4  and  3.5.  For  the  correlation 
length,  the  value  pi,  =  0.845  m  was  used.  This  value  was  employed  in  Section  3.2  to  match  the 
Gaussian  structure  function  to  the  Kolmogorov  structure  function  with  tq  =  0.2  m  and  outer  scale 
of  1  m.  As  illustrated  in  Figure  3.4,  eliminating  the  lowest  spatial  frequencies  helps  because  the 
weighting  functions  fiw.„  have  large  magnitudes  for  low  spatial  frequencies,  since  most  of  the  power 
in  the  phase  fluctuations  is  concentrated  in  this  region.  Note  that  Figure  3.4  shows  the  rms  error  in 
the  estimate  normalized  by  the  tilt-removed  rms  variation,  .  As  the  lowest  spatial  frequencies 
are  filtered  out,  the  rms  phase  variation  decreases.  This  normalization  shows  the  improvement  in 
the  performance  of  the  estimator  alone,  since  it  compares  the  rms  error  in  the  estimate  to  the  rms 
phase  deviation  in  the  wavefront  component  that  is  actually  in  the  measurement.  Figure  3.5  shows 
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the  same  data  normalized  by  the  non-tilt-removed  rms  variation,  .  This  second  plot  is  included 
to  illustrate  the  benefits  of  tilt  removal  to  overall  system  performance.  In  this  dissertation,  the 
primary  interest  is  in  the  performance  of  the  wavefront  component  estimator,  so  the  plots  discussed 
in  the  next  section  are  normalized  by  the  tilt-removed  rms  variation,  . 


Figure  3.4.  Estimator  performance  as  function  of  tilt  bandwidth  (p),  =  0.845  m,  =  (10“*,0) 
rad,  fsa,  =  5  m~S  zi,  =  5000  m).  Results  normalized  by  tilt-removed  phase  variance. 

5.7.2  WFS  Bandwidth.  Figure  3.6  shows  that  the  error  performance  improves  as  the  spa¬ 
tial  bandwidth  of  the  wavefront  sensor  is  increased.  An  increase  in  the  WFS  spatial  bandwidth 
corresponds  to  a  decrease  in  the  size  of  the  subaperture  according  to  equation  (3.37).  As  the  wave- 
front  is  sampled  with  smaller  apertures,  the  high  frequency  components  in  the  random  process 
are  measured  more  accurately.  The  performance  improvement  is  asymptotic  because  as  the  WFS 
bandwidth  increases,  eventually  all  of  the  significant  power  in  the  ipi,  random  process  is  contained 
within  the  passband. 

3.7.3  Phase  Correlation  Length.  Figure  3.7  shows  how  the  performance  of  the  estimator 
varies  as  the  width  of  the  wavefront  autocorrelation  function  is  varied.  Performance  improves 
as  pi,  increases  since  a  wider  autocorrelation  function  corresponds  to  a  narrower  power  spectrum. 
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Figure  3.5.  Estimator  performance  as  function  of  tilt  bandwidth  (pi,  =  0.845  m,  =  (10“®,0) 
rad,  /sa  =  5  m~^  z\^  =  5000  m).  Results  normalized  by  non-tilt-removed  phase 
variance. 


Figure  3.6.  Estimator  performance  as  function  of  WFS  spatial  bandwidth  (pi,  =  0.845  m,  6i  = 
(10-^0)  rad,  /*  =  0.5  m"!,  zi,  =  5000  m) 
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Hence,  as  pi,  increases  more  of  the  significant  power  in  the  random  process  falls  within  the  passband 
of  the  WFS  transfer  function  Asa(/  )•  This  effect  is  asymptotic  for  the  same  reason  as  discussed  in 
the  previous  paragraph. 


Figure  3.7.  Estimator  performance  as  function  of  wavefront  correlation  length  {By  =  (10~®,0) 
rad,  /a  =  0.5  m~*,  /sa  =  5  m“*,  zi,  =  5000  m) 

3.7.4  Guide  Star  Separation.  Figure  3.8  shows  that  estimator  performance  improves  as 
the  reference  stars  are  spread  more  widely  apart.  Wider  reference  star  separation  means  that 
the  optimum  weights  must  shift  the  measurements  a  greater  distance  in  order  to  correlate  the 
contribution  from  the  turbulent  layer  of  interest  in  each  wavefront  measurement.  Such  large  spatial 
shifts  also  have  the  beneficial  effect  of  decorrelating  the  contributions  from  the  other  layers  more 
than  small  spatial  shifts.  This  improvement  is  also  asymptotic;  after  a  point,  the  additional  benefit 
of  further  decorrelation  is  negligible  since  after  a  point  the  shift  of  the  competing  layers  is  so  large 
as  to  be  on  the  skirts  of  the  correlation  functions.  The  finite  size  of  the  telescope  aperture,  not  yet 
taken  into  account,  would  restrict  guide  star  separation  in  a  practical  system. 
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Figure  3.8.  Estimator  performance  as  function  of  reference  star  angular  separation  (pi,  =  0.845 
fa.  =  0.5  m~\  fsa  =  5  21,  =  5000  m) 

3. 8  Example 

The  results  of  the  previous  section  are  now  applied  to  the  analysis  of  the  performance  of  a 
multiconjugate  adaptive  optics  system.  For  this  example,  it  is  assumed  that  the  seeing  conditions 
are  characterized  by  Kolmogorov  turbulence  with  tq  =  20  cm.  The  Kolmogorov  structure  function 
is  approximated  with  a  Gaussian  structure  function,  as  shown  in  Figure  3.2,  and  it  is  found  that 
<7  =  7.75  rad  and  p  =  0.845  m  are  good  choices  of  parameters. 

It  is  assumed  that  the  turbulence  is  characterized  by  two  identical  layers,  one  at  altitude 
21,  =  5000  m  and  the  other  at  21,  =  10000  m,  so  the  final  result  of  the  last  Section  (3.45)  can  be 
used  in  the  calculations  for  this  example.  Using  formula  (3. 15),  it  is  seen  that  ri  =r2~30cm.  Also, 
the  variance  of  the  phase  divides  equally  between  the  two  turbulent  layers,  hence  =  (t^,^  =  5.5 
rad. 

Consistent  with  assumption  (3.32)  the  guide  stars  are  positioned  symmetrically,  with  61  = 
—02  =  (10”®,0)  rad.  For  an  effective  aperture  size  of  I  m,  and  assuming  tilt  accounts  for  up  to 
1/2  cycle  per  effective  telescope  aperture,  /«  =  0.5  m“*  is  obtained.  This  leads  to  a  value  of 
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of  ~  1.9  rad,  implying  that  ~  88%  of  the  power  in  the  random  process  is  contained  in  the 
tilt  component,  which  is  about  right  for  Kolmogorov  turbulence.  For  subapertures  that  are  10  cm 
square,  equation  (3.37)  may  be  used  to  approximate /sa  as  5  m“’.  Using  these  parameters  in  (3.45) 
the  plot  shown  in  Figure  3.9  is  obtained,  which  gives  the  error  in  estimating  the  contribution  of 
one  of  the  two  turbulent  layers  to  the  total  wavefront. 


0.5  0.45  0.4  0,35  0.3  0.25  0.2  0.15  0.1  0.05  0 

<Tn  (rad) 

Figure  3.9.  Estimator  performance  for  —  5.5  rad,  pi,  =  0.845  m,  0\  =  (10~®,0)  rad,  /sa  =  5 
m~*,  /a  =  0.5  m“^  ii,  =  5000  m 

A  typical  goal  for  an  adaptive  optics  system  is  to  remove  the  rms  wavefront  phase  distortion 
to  <  0.5  rad  (19).  The  error  in  the  estimated  wavefront  component  will  be  one  of  several  factors 
contributing  to  the  inability  of  the  adaptive  optics  system  to  perfectly  remove  phase  distortion. 
Some  of  the  possible  factors  for  a  two-DM  MCAO  system  include:  1)  wavefront  estimation  error 
for  two  wavefront  components,  2)  overall  tilt  removal  error,  and  3)  error  due  to  the  change  in  the 
state  of  the  atmosphere  during  processing  time  (time  delay  error).  In  order  to  achieve  a  system 
design  goal  of  0.5  rad  rms  residual  phase  error,  the  variance  of  each  of  the  4  aforementioned  error 
sources  must  be  a  factor  of  4  less  than  the  goal.  This  consideration  leads  to  a  desired  wavefront 
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component  estimation  error  of  ~  0.25  rad  rms.  From  Figure  3.9,  it  is  seen  that  this  level  of 
performance  requires  the  wavefront  measurement  error  to  be  <  0.08  rad  rms. 

Wavefront  sensor  measurement  noise  power  is  inversely  related  to  the  number  of  photons 
entering  the  subaperture  per  integration  time,  ATpj,: 


(rad) 


(3.47) 


where  is  a  constant  which  depends  on  wavefront  sensor  characteristics.  In  their  recent  analysis 
of  a  laser  guide  star  adaptive  optical  telescope,  Gardner  et  al.  used  —  1.30  (22).  From  the 
above  formula  and  Figure  3.9,  it  is  seen  that  264  photons  per  subaperture  per  wavefront  sensor 
integration  time  would  be  necessary  to  estimate  to  the  required  accuracy.  This  flux  level  is 

about  14  times  that  specified  by  Gardner  et  al.  for  a  single-DM  adaptive  optics  system  to  achieve 
a  rms  wavefront  error  of  ~  0.57  rad  when  ro  =  20  cm. 


3.9  Summar-y 

In  this  chapter,  a  frequency  domain  approach  was  taken  to  solve  the  problem  of  estimating 
the  contribution  of  different  parts  of  the  atmosphere  to  total  wavefront  aberration.  This  approach 
provides  some  insight  into  the  problem,  but  the  approximations  involved  make  the  quantitative 
results  somewhat  questionable.  The  next  chapter  describes  a  spatial  domain  analysis  which  involves 
considerably  fewer  approximations. 
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IV.  Spatial  Domain  Analysis 


4-1  Introduction 

In  this  chapter,  a  statistical  approach  is  used  determine  the  proper  commands  for  each  DM. 
The  approach  is  based  on  Waliner's  general  adaptive  optics  system  analysis  method  (62),  which  is 
extended  to  include  systems  with  multiple  guide  stars  and  DMs.  In  Section  4.2,  the  atmosphere 
model  used  in  this  chapter  is  described,  while  Section  4.3  outlines  the  adaptive  optics  system  model. 
The  main  analytical  results  are  in  Section  4.4,  in  which  expressions  for  two  types  of  mean  square 
residual  wavefront  phase  error  are  derived;  1)  averaged  over  the  surface  of  each  DM  and  2)  averaged 
over  the  area  of  an  object  wavefront.  The  DM  control  commands  are  chosen  to  minimize  the  first 
type  of  error,  while  the  second  type  of  error  is  the  criterion  used  to  evaluate  performance  of  the 
MCAO  system.  In  Section  4.5,  specific  system  and  atmosphere  characteristics  are  assumed  in  order 
to  make  numerical  computations.  In  Section  4.6,  the  results  of  such  computations  are  shown  and 
a  summary  is  in  Section  4.7. 

4-2  .Atmospheric  Model 

As  discussed  in  previous  chapters,  several  factors  motivate  the  consideration  of  a  layered 
atmosphere  model  for  an  analysis  of  MCAO.  One  reason  is  that  experimental  evidence  indicates  that 
the  optical  turbulence  in  the  atmosphere  is  often  concentrated  in  a  small  number  of  layers  (52,  61). 
However,  any  turbulence  distribution  could  be  approximated  by  using  a  sufficient  number  of  layers. 
Since  an  MCAO  system  approximates  a  layered  atmospheric  structure,  the  use  of  such  a  model  is 
convenient  for  analysis  purposes. 

To  build  a  layered  atmospheric  model,  it  is  first  assumed  that  the  refractive  index  structure 
constant,  C^{z)  (m“^/^),  is  known  as  a  function  of  altitude,  z.  This  information  could  come  from 
a  remote  sensing  instrument.  This  continuous  profile  is  discretized  into  I  thin  turbulent  layers  at 
altitudes  z\^  (m).  Each  layer  is  modeled  with  a  random  phase  screen,  rp\,{x)  (rad).  The  phase 
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screen  V’i,(^)  describes  how  the  phase  of  an  optical  wave  propagating  through  the  horizontal  plane 
z  —  zi,  is  altered  as  a  two-dimensional  function  of  position  x  (m).  This  function  is  assumed  to  be 
a  sample  function  of  a  random  process  characterized  by  a  structure  function  (rad“); 

Di(x,£')  =  ([V-i.C?)  -  ti('i.(*')]“)  (4.1) 

where  the  angle  brackets  indicate  an  ensemble  average.  The  turbulent  layers  are  assumed  to  be 
statistically  independent  (26,  45). 

As  in  Chapter  III,  geometrical  optics  propagation  is  assumed  between  the  turbulent  layers. 
This  layered  turbulence  model  is  now  used  to  write  an  expression  for  the  total  phase  aberration 
on  an  optical  wave  emanating  from  a  point  source  at  an  arbitrary  position.  Figure  4.1  shows 
a  ray  traced  from  the  source  through  turbulent  layer  V’!,(^)  to  point  x  in  the  aperture  plane. 
The  location  of  the  source  is  given  by  the  position  vector  r*  =■  (zs^s^i ^s)  (using  a  small 
angle  approximation).  The  position  vector  p)  denotes  the  point  where  the  ray  crosses  the  plane  of 
turbulent  layer  i.  From  the  geometry,  p,  is  found  to  be 

Pi  =  ^+  {[(rs-*)  *]i+[(rs-i)  p]y}  (4.2) 

where  x  and  y  are  unit  vectors  in  the  x  and  y  directions.  This  expression  can  be  simplified  to 


Using  this  result,  along  with  the  geometrical  optics  assumption,  the  total  wavefront  in  the 
aperture  plane,  from  a  source  located  at  position  9s  (rad)  and  altitude  z*  (m)  is  the  sum  of 
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the  1  turbulent  layer  (phase  screen)  contributions. 


/ 

V’l. 

1=1 

where  is  the  contribution  of  turbulent  layer  i.  In  this  model,  the  spherical  surface  of  the 

wavefront  from  a  finite  altitude  source  is  ignored.  In  practice,  it  must  be  accurately  determined  (36). 

4  S  System  Model 

The  adaptive  optics  system  model  described  in  this  section  is  based  on  the  model  presented 
by  Wallner  in  (62).  His  model  was  also  used  by  Welsh  and  Gardner  to  study  the  single-DM,  open 
loop  adaptive  telescope  schematically  diagrammed  in  Figure  4.2.  In  this  illustration,  a  reference 
wavefront  propagates  through  a  region  of  optical  turbulence  and  arrives  at  the  telescope  aperture 
as  an  aberrated  wavefront  described  by  the  phase  perturbation  function,  t/'t(x)  (rad).  A  WFS 
in  a  conjugate  plane  to  the  aperture  measures  the  average  wavefront  gradient  over  an  array  of 
subapertures.  The  control  matrix,  M,  linearly  combines  the  slope  measurements,  s,  to  obtain  a 
set  of  command  voltages,  c,  to  be  applied  to  the  actuators  on  the  DM.  Typically,  the  DM  is  also 
conjugated  to  the  aperture  plane.  The  matrix  M  is  chosen  to  minimize  some  functional  of  the 
ifsidual  wavefront  error,  e(x)  (rad). 

In  this  chapter,  this  model  is  generalized  so  that  adaptive  telescopes  with  K  DMs  can  be 
studied.  As  shown  in  Figure  4.3  {K  =  2),  the  light  reflects  off  each  DM  in  series.  In  this  approach, 
the  DMs  are  conjugated  to  planes  at  a  range  of  distances  from  the  aperture.  Appropriate  command 
vectors  c*  (or,  equivalently,  control  matrices  M*)  are  chosen  so  that  the  DMs  replicate,  as  closely  as 
possible,  the  three  dimensional  turbulence  structure.  Although  not  shown  in  Figure  4. ,3  for  clarity, 
this  multipie-DM  system  still  ha'-  a  single  WFS  conjugated  to  the  aperture  plane. 


(-t) 


+  21, 


(4.4) 
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Figure  4.2.  Single-DM,  open-loo|)  adaptive  optical  telescope 


4-3.1  Guide  stars  Multiple  guide  stars  are  required  for  an  MCAO  telescope  for  two  reasons; 
1)  to  completely  sample  the  turbulence  structure  throughout  the  FOV  to  be  compensated  (57), 
and  2)  to  combine  multiple  measurements  to  improve  the  quality  of  the  estimate  of  the  turbulent 
layer  contributions  (5).  With  A/  guide  stars  positioned  to  project  through  the  turbulence  region  at 
different  angles,  the  adaptive  optical  system  will  collect  an  ensemble  of  M  wavefront  measurements. 
In  Section  4.4,  it  is  shown  that  the  spatial  diversity  in  the  measured  wavefronts,  along  with  the 
geometrical  optics  assumption,  will  enable  determination  of  the  DM  control  matrices,  Mt,  using 
standard  minimum-variance  techniques. 

Since  very  few  natural  stars  are  bright  enough  to  be  used  for  WFS  measurements  at  visible 
wavelengths,  the  guide  star  array  for  a  multiple-guide-star  system  must  be  artificially  generated.  As 
discussed  in  Chapter  II,  laser  beams  have  already  been  used  to  make  practical  WFS  light  sources 
using  Rayleigh  scattering  from  air  molecules  in  the  troposphere  (20)  and  stimulation  of  resonance 
fluorescence  in  a  naturally  occurring  layer  of  sodium  atoms  in  the  mesosphere  (35).  Overall  tilt 
information  cannot  be  obtained  from  the  artificial  guide  stars,  since  the  laser  projection  system 
also  requires  tilt  compensation.  Rather,  natural  guide  stars  are  used  to  provide  a  reference  for  tilt- 
correction  with  a  two-degree-of-freedom,  or  “tip-tilt”  mirror.  Since  the  full  aperture  may  be  used 
for  overall  tilt  sensing,  the  brightness  requirements  are  not  as  stringent  for  the  tilt  compensation 
guide  star  as  for  the  WFS  reference.  However,  to  obtain  the  full  benefits  of  high-order  correction, 
commensurately  accurate  tilt  compensation  is  required.  For  this  reason,  the  brightness  of  the  tilt 
guide  star  and  its  proximity  to  the  object  are  limiting  factors  on  the  performance  of  any  laser  guide 
star  adaptive  optics  system,  including  an  MCAO  system  (16,  51,  54), 

The  position  of  each  of  the  M  reference  stars  is  described  by  a  two-dimensional  vector  Om  = 
(^m, ,  ^m»)-  The  altitude  of  the  guide  stars  is  designated  Zg.  Using  equation  (4.4),  an  expression  for 
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the  total  wavefront  phase  perturbation  in  the  aperture  plane  due  to  guide  star  m  can  be  written; 


[^(l-  +-i/n 


(4.5) 


One  way  to  cissociate  the  measured  wavefronts  with  their  reference  sources  is  by  time  multi¬ 
plexing  the  guide  stars.  However,  it  is  required  that  all  M  phase  perturbation  measurements  be 
made  within  the  atmosphere’s  correlation  time  (27).  This  requirement  clearly  limits  the  number  of 
measurements.  Researchers  have  already  demonstrated  adaptive  optics  using  sequential  wavefront 
measurements  from  two  artificial  guide  stars  (47).  In  this  experiment,  the  guide  stars  were  created 
via  Rayleigh  backscattered  laser  light.  It  is  speculated  that  wavelength  multiplexing  could  be  an 
other  possible  means  of  distinguishing  the  measurements  when  using  Rayleigh  guide  stars.  Sodium 
guide  stars  have  the  advantage  of  having  a  much  higher  altitude,  thus  reducing  the  effects  of  focus 
anisoplanatism  (22).  However,  since  all  sodium  guide  stars  have  the  same  wavelength,  they  cannot 
be  distinguished  chromatically. 

4-3.2  Wavefront  sensor  Recall  that  the  proposed  MCAO  system,  although  having  several 
DMs,  has  only  one  WFS  optically  conjugated  to  the  telescope  pupil.  In  this  section,  the  WFS  is 
modeled  after  a  Hartmann  sensor.  This  type  of  WFS  outputs  the  average  wavefront  gradient  over 
a  grid  of  subapertures.  Before  modeling  the  output  of  the  WFS,  the  aperture-average  phase  is 
removed  from  the  mth  reference  wavefront,  ^t„(*)  to  obtain  a  piston-removed  wavefront, 


¥’t„(*)  =  V’t„(x)-t^t„ 


(4.6) 


where  the  piston  mode  of  the  wavefront  from  the  mth  guide  star,  i  is  given  by 


=  J  d-xW,{x)^tJx) 


(4.7) 
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and  Wa,(x)  (m  is  the  weighting  function  of  the  telescope  aperture,  which  for  convenience  is 
normalized  such  that 

Jd^xW^(x)^\  (4.8) 

Throughout  this  chapter,  the  notation  tp  indicates  a  wavefront  with  piston  removed  over  the  area 
of  the  telescope  aperture. 

To  model  the  WFS,  the  telescope  aperture  is  divided  into  N/2  subapertures.  Assuming  small 
wavefront  deviation  over  the  subaperture,  the  WFS  output  can  be  approximated  as  the  average 
wavefront  gradient  over  each  subaperture  (62).  Thus,  the  wavefront  sensor  output  consists  of  an 
A^-dimensional  vector  of  slope  values — the  slope  in  the  x  and  y  directions  over  each  subaperture. 
The  nth  slope  measurement  from  the  mth  guide  star  Smn  (rad/m),  is  described  by  (62) 

Smn  =  -  j  [VW„(£)  •d]<^t„(^)  ~  +  Omn  (4  9) 


where 

Wn{x)  (m”^)  is  the  weighting  function  of  the  subaperture  used  for  slope  measurement  n, 
Pi„{x)  (rad)  is  the  piston- removed  wavefront  from  guide  star  m, 
d  is  the  direction  of  sensitivity  of  the  nth  slope  sensor  (x  for  odd  n  and  y  for  even  n), 
is  the  overall  tilt  in  the  d  direction  during  upward  propagation, 

Omn  (rad/m)  is  the  noise  in  the  measurement, 
and  V  is  the  gradient  operator. 

As  shown  in  Appendix  B,  the  first  term  of  equation  (4.9)  is  obtained  by  integrating  by  parts 
an  expression  for  the  aperture  averaged  wavefront  gradient.  It  represents  the  true  value  of  the 
average  slope  of  <pt„(x)  over  the  subaperture  Wn(x)  while  the  last  term  represents  measurement 
noise  effects.  The  second  term  of  equation  (4.9)  desc'rves  a  more  detailed  discussion.  The  overall  tilt 
term,  Um, ,  is  modeled  as  the  best  fit  plane  to  the  laser  beam  wavefront  after  upward  propagation  to 
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the  scattering  layer,  given  by  +  UmyH-  This  term  is  present  because,  due  to  optical  reciprocity, 
the  overall  tilt  experienced  by  the  laser  beam  on  the  way  up  partially  cancels  that  experienced 
by  the  guide  star  wavefront  on  the  downward  propagation.  The  amount  of  cancellation  depends 
on  the  size  of  the  leiser  projection  aperture  relative  to  the  pupil.  It  is  important  to  include  this 
“blindness”  to  overall  tilt  to  obtain  accurate  results  from  this  analysis.  Not  including  this  term  is 
equivalent  to  assuming  that  overall  tilt  information  is  available  in  the  direction  of  each  guide  star, 
which  is  unrealistic.  Letting  a  equal  x  or  y,  as  appropriate,  the  tilt  coefficient  is  found  to  be 

/  d^xWp{x)a>fix,Ax) 

-  (4.10) 

/  d^xWp{x)a~ 


where  Wp{x)  (m  -)  is  the  weighting  function  of  the  laser  projection  aperture. 


Following  the  usual  practice,  the  slope  measurement  noise,  Omn  is  modeled  as  zero  mean 
and  uncorrelated  with  the  turbulence  (62).  Also,  the  noise  is  uncorrelated  from  subaperture  to 
subaperture  and  measurement  to  measurement.  Furthermore,  the  noise  added  to  the  x-directed 
slope  is  uncorrelated  with  that  added  to  the  y-directed  slope,  even  when  measured  in  the  same 
subaperture.  Mathematically, 


(4.11) 


(«mnt(’l.(*  ))  =0 


(4.12) 


where  (T^  ((rad/m)^)  is  the  variance  of  the  slope  measurement  noise  and  bqq'  is  the  Kronecker  delta. 


4-3.3  Deformable  mirrors  The  mathematical  model  for  the  /fcth  DM  consists  of  a  collection 
of  Jk  two-dimensional  actuator  influence  functions  r,t(x  )  (m).  Physically,  these  functions  represent 
the  shape  of  the  mirror  surface  when  a  unit  command  voltage  is  applied  to  the  jth  actuator  on  the 
kth  DM,  with  the  other  actuators  held  at  their  quiescent  position.  The  spatial  extent  of  DM  k  is 
indicated  by  a  weighting  function,  which  is  normalized  in  the  same  manner  as  the 


4-10 


pupil  weighting  function,  H'a(j  ).  Unlike  the  case  of  the  single-DM  adaptive  telescope  studied  by 
Welsh  and  Gardner  (63),  the  DM  weighting  function  is  not,  in  general,  the  same  as  the  aperture 
weighting  function.  As  shown  in  Figure  4.4,  to  widen  the  corrected  FOV  the  DM  must  compensate 
an  area  larger  than  the  aperture.  The  extent  of  (£)  is  determined  by  the  altitude  of  the  plane 
conjugate  to  the  Jfcth  mirror,  (m),  and  the  FOV  of  the  telescope,  0  (rad). 

4-4  Analysts 

The  previous  sections  defined  a  layered  atmospheric  model  and  generalized  W^allner’s  adaptive 
optics  system  models  to  allow  for  an  arbitrary  number  of  guide  stars  and  DMs.  This  section  presents 
a  method  which  uses  the  ensemble  of  wavefront  measurements  to  obtain  K  sets  of  DM  command 
voltages.  The  intent  is  to  construct  a  three-dimensional  correction  for  atmospheric  distortion  in 
the  optical  train  of  the  telescope.  The  general  approach  is  to  divide  the  atmosphere  into  K  slabs, 
corresponding  to  the  available  K  DMs.  The  contribution  of  the  ikth  slab  to  the  total  wavefront  is 
called  the  Jkth  wavefront  “component”  and  designated  The  surface  of  DM  k  is  figured  to 

compensate  for  the  wavefront  )«  where  <i>^^  is  the  piston-  and  tilt-reraoved  phase 

=  J  +  (4.13) 

where  is  defined  in  the  same  way  as  Um,  (equation  (4.10))  except  the  DM  weighting  function 
is  used.  (It  is  desirable  to  place  a  piston-  and  tilt-removed  wavefront  on  the  surface  of  the  DM  to 
help  ensure  that  the  actuators  are  not  driven  beyond  the  limits  of  their  dynamic  range.  Recall  that 
overall  tilt  is  compensated  by  a  separate  correction  system,  not  by  the  DMs.)  In  analyzing  the 
performance  of  an  MCAO  system,  two  types  of  errors  are  of  interest.  The  first  type  is  the  mean 
square  difference  between  the  desired  and  actual  surfaces  of  the  itth  DM.  Then  an  optimum  matrix, 
m;,  is  chosen  to  minimize  this  difference  using  the  phase  screen  statistics.  This  reconstruction 
method  is  known  as  the  minimum  variance  approach.  To  evaluate  the  performance  of  the  MCAO 
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Figure  4.4.  Size  of  DM  k  weighting  function. 


system,  the  second  type  of  error  is  computed:  the  average  residual  phcuse  error  in  the  object 
wavefront  after  compensation  by  all  K  DMs  and  the  tip-tilt  mirror.  First,  consider  the  error  in  the 
reconstructed  wavefront  on  the  individual  DMs. 

4-4-J  Error  in  the  reconstructed  wavefront  component  Recall  that  the  goal  is  to  have  each 
DM  in  the  MCAO  system  compensate  for  the  phase  distortion  effects  for  a  certain  portion  of 
the  atmosphere.  That  contribution,  or  wavefront  component,  must  be  reconstructed  using  the 
information  in  the  slope  measurements  from  M  guide  stars  and  a  prion  knowledge  of  phase  and 
noise  statistics.  The  analysis  in  this  section  results  in  expression  for  the  reconstructed  wavefront 
component  placed  on  the  Fth  DM.  As  discussed  previously,  each  wavefront  measurement  consists  of 
an  A-dimensional  vector  of  subaperture  slopes,  s^,  collected  in  the  aperture  plane  using  guide  star 
m.  If  there  are  M  guide  stars,  then  altogether  there  are  MN  slope  measurements.  The  command 
voltage  Cjic,  to  be  applied  to  actuator  j  on  DM  k,  is  computed  from  a  linear  combination  of  these 
slope  measurements: 

M  N 

hdjkmn^mn 

m=l  n=l 

where  Mjkmn  is  the  weight  of  the  nth  slope  measurement  from  the  mth  guide  star  in  the  jtb 
actuator  command  for  DM  k.  In  matrix  notation, 

Cfc=Mts  (4.15) 

where  the  vector  c*  has  the  individual  Cjt  as  its  elements  and  the  vector  s  contains  all  MN  slope 
measurements.  The  matrix  Mt  is  a  V*  x  MN  reconstruction  matrix  for  DM  k. 

After  figuring,  the  surface  of  DM  k  may  be  mathematically  described  by  a  linear  combination 
of  actuator  influence  functions  rjt(x).  With  the  actuator  commands  given  by  equation  (4.15), 
the  surface  of  the  DM  forms  an  estimate  of  the  wavefront  component  Designating  this 
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estimated  wavefront,  it  may  be  written  as 


=  MkSTkix)  (4-16) 

where  rt(x  )  is  a  J* -dimensional  vector  of  actuator  influence  functions  for  DM  k. 

The  error  in  the  reconstructed  wavefront  component  placed  on  the  A:th  DM,  £i(r),  may  be 
written 

ik(x)  =  ^d^(x)-  (p^Jx)  (4.17) 

The  mean  square  error  averaged  over  the  surface  of  the  i;th  DM  is 

(f*)  =  J  ^ 

=  J  d^xW^^{x)(^^l^(£)^+  J  d^xWdA^){<^l^{x)) 

-2  J  d^xWi^{x)(^4^ix)4>dAx))  (4-18) 

where  W^i^(x)  is  the  aperture  weighting  function  for  DM  k.  Implicitly,  (e^)  is  a  function  of  system 
parameters  such  as  the  number  and  positions  of  the  guide  stars,  DM  actuator  density,  etc. 

By  substituting  equation  (4.16)  in  equation  (4.18),  (e|)  can  be  computed  in  terms  of  phase 
correlations  that  can  be  computed  using  Kolmogorov  statistics.  The  first  term  of  equation  (4.18) 
is  the  average  variance  of  the  /tth  reconstructed  wavefront  component; 

J d^xWi^(x)  =  tr  [MjRiMtS,,]  (4.19) 

where  tr[  ]  indicates  the  trace  of  the  matrix,  S,,  is  the  slope  measurement  covariance  matrix: 

E„  =  (ss"^)  (4.20) 
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and  R*  is  the  actuator  correlation  matrix  of  the  Jtth  DM: 

R*  =  y  )rjt(f  )rj(f )  (4.21) 

The  second  term  of  equation  (4.18)  is  the  mean  square  uncorrected  phase  error  of  the  A:th  wavefront 
component,  (fo^): 

(foj  =  J  d-xW^^{x){<f>l^(x))  (4.22) 

Finally,  the  third  term  of  equation  (4.18)  can  be  written 

J  d^xWi^(x)(^^dA^)<i>^^(x)'j  =  tr  [aJm*]  (4.23) 

where  the  matrix  A*  is  given  by 

At  =  J  dPxWdA^)^k(x){s^<j>dA^))  (4-24) 

The  final  result  for  the  mean  square  residual  phase  error  in  the  reconstructed  wavefront 

component  on  the  ibth  DM  is 

{4)  =  «)  +tr  [MjRtM/tS,,]  -  2tr  [A^’Mi]  (4.25) 

Note  that  detailed  derivations  of  the  expressions  in  this  section  are  in  Appendix  B. 

4-4-^  Mintmum  variance  reconstrucUon  method  One  could  suggest  a  number  of  ways  to 
derive  control  matrices  for  an  MCAO  system.  The  method  used  in  this  chapter  is  to  independently 
minimize  the  mean  square  residual  phase  error  averaged  over  the  surface  of  each  DM.  In  other 
words,  equation  (4.25)  is  minimized  with  respect  to  the  control  matrix  M*  to  obtain  an  optimum 
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control  matrix  for  each  DM.  As  shown  in  Appendix  B,  this  minimization  results  in 

M;  =  (4.2G) 

which  is  of  exactly  the  same  form  as  Wallner’s  optimum  matrix  for  a  single-DM  system  (62). 
Wallner’s  discussion  of  the  existence  conditions  for  the  matrix  inverses  in  equation  (4.26)  also 
applies  here. 

Substitution  of  equation  (4.26)  into  equation  (4.25)  yields  an  expression  for  the  minimum 
component  reconstruction  error: 


(4)nun  =  (4)  -  [SrMJ'Ri'A,]  (4.27) 

4-4-3  Residual  phase  error  in  the  compensated  object  wavefront  In  the  preceding  section 
the  mean-square  residual  error  in  each  wavefront  component  estimate  (i.e.  the  figured  surface  of 
each  DM)  was  derived.  This  quantity,  {el)  is  minimized  by  proper  selection  of  control  matrices, 
MJ.  The  main  interest,  however,  is  not  how  well  each  wavefront  component  is  estimated,  but 
how  well  an  object  wavefront  is  compensated  as  a  function  of  its  position  in  the  telescope’s  FOV. 
Theoretically,  if  the  individual  wavefront  components  ar?  well  estimated  the  object  wavefront  will 
be  well  compensated.  In  this  section,  the  compensated  object  wavefront  error  is  evaluated  by 
deriving  an  expression  for  the  mean-square  residual  phase  error  averaged  over  an  object  wavefront, 
after  compensation  by  all  K  DMs  and  the  tip-tilt  mirror.  For  calculation  purposes,  it  is  assumed 
that  the  object  is  a  far-field  point  source.  (This  is  not  to  imply  that  the  utility  of  MCAO  is  limited 
to  point  sources.  The  goal  of  the  MCAO  system  described  in  this  chapter  is  to  compensate  nearby 
or  extended  sources  as  well.  For  analysis  purposes,  it  is  simpler  to  decompose  the  object  field  into 
a  superposition  of  point  sources  and  evaluate  the  system  performance  at  selected  points.)  Using 
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equation  (4.4),  the  piston-removed  wavefront  from  the  object  can  be  written 


/ 

<Po(r.^o)  =  ~i.^o)  (4.28) 

1 

where  0o  is  the  angular  position  of  the  object.  This  wavefront  propagates  through  the  optical  train 
of  the  telescope,  reflecting  off  the  tip-tilt  mirror  and  the  K  DMs  in  series.  The  system  can  be 
analyzed  as  if  the  compensation  provided  by  each  DM  takes  place  in  its  corresponding  conjugate 
plane.  Thus,  the  residual  phase  error,  e(z,  ^o)  (rad),  of  the  compensated  wavefront  can  be  written; 

I  K 

e(x,S‘o)  =  z,,0o)  (4.29) 

i=l 

where  iPt(x)  is  the  overall  tilt  correction  using  a  natural  star  located  in  the  telescope’s  FOV^  Recall 
that  the  DMs  do  not  provide  overall  tilt  correction.  In  the  preceding  equation,  it  is  important  to 
note  that  the  average  phase  over  the  area  of  the  telescope  aperture  must  be  removed  from  the  DM 
terms  to  get  accurate  results.  This  is  because  an  arbitrary  aperture-sized  area  on  the  surface  of 
the  DM  may  have  a  non-zero  average  value  even  though  the  entin;  DM  surface  has  zero  piston.  Of 
interest  is  the  average  mean  ‘^  quare  residual  error  for  a  particular  observation  angle, 

(e'(^o))  =  J  d^iW,(x)  (^e^x,  C))  (4.30) 


Using  a  procedure  similar  to  that  outlined  in  the  previous  section,  the  following  expression  is 
obtained: 


K  K 


I  K 


{^H9o))  =  (eg(0o))  +  ££tr[M:'*'m*,(^o)M;,£„j-2j]£tr[AT(/o)M;j 

1=11=1 

-f  /  d^xWJ,x){^l{x))  -2^^  I  dPxW^{x)  (^ipr(x)>P[,{x  +  zi/o)) 

«=i 

K  . 

+2^  J  d^xWaix)Ml(s^r{x)^  ■rk(x  + (4.31) 
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The  hrst  term  in  the  preceding  equation  is  the  error  in  the  uncompensated  object  wavefront: 


(f5(^o))  -  J  (4.32) 

The  matrix  R*:*'(^o)  is  of  dimension  Jk  x  J*-  and  is  given  by 

R*i'(^o)  =  J  d-xWi,{x}^rk[x  +  ZdJo)Tj,{x  +  Zd^,0o)  -  fk(0o)rJ,{Oo)  (4.33) 

where 

^^(0o)  =  j  d-xW^(x)rk(^+ZdJo)  (4.34) 

The  second  term  in  equation  (4.33)  results  from  removing  the  average  phase  from  an  aperture-sized 
area  on  the  surface  of  the  DM.  The  Aji(^o).  (dimension  Jt  x  MN)  matrix  is  defined  as 

A.ik(0o)  =  J  d'^j^W3,ix)rk{x  + ZiJo)(s'^<fi\Sx  + (4.35) 

Having  derived  an  expression  foi  the  object  wavefront  error  in  terms  of  phase  correlations 
and  system  parameters,  it  is  now  possible  to  make  numerical  cemputations  in  order  to  evaluate 
system  performance.  No-^  that  Appendix  B  contains  the  detailed  derivations  of  the  results  of  this 
section. 

4-5  Numerical  Computaiw.tS 

In  the  preceding  sections,  a  general  framework  for  analyzing  a  multiple-DM,  multiple-guide- 
star  adaptive  telescope  was  developed.  The  analysis  is  not  completely  general  in  that  a  particular 
method  for  obtaining  control  matrices  was  chosen  and  the  amplitude  effects  of  atmospheric  tur¬ 
bulence  were  neglected.  However,  specifics  of  the  telescope  hardware  and  configuration,  as  well 
as  noise  and  phase  statistics,  have  yet  to  be  specified.  In  this  section,  a  particular  telescope  is 


selected  and  commonly  accepted  turbulence  and  noise  statistics  are  used  to  perform  numerical 
computations. 


4-5.1  Example  system  To  simplify  the  computations,  the  telescope  is  specified  to  have  a 
square  aperture  of  dimension  D  (m)  divided  into  a  rectangular  grid  of  square  subapertures  of 
dimension  d  (m)  where  N  =  2{Dfd}^  is  the  number  of  slope  measurements.  The  WFS  measures 
the  wavefront  gradient  averaged  over  each  of  these  subapertures.  Figure  4.5  illustrates  the  WFS 
grid  for  the  case  of  D/d  =  6.  The  subaperture  size  is  set  equal  to  the  atmospheric  coherence 
diameter,  ro  (17).  Similarly,  the  DMs  are  square  and  the  DM  actuators  are  arranged  in  a  square 
grid,  with  the  interactuator  spacing  on  all  mirrors  equal  to  tq.  All  the  actuator  response  functions 
are  Gaussian-shaped: 


rjk{x,y)  oc  exp 


-  [(^  -  Xjkf  +  (y  -  Vjkf 


Pm 


(4.36) 


where  xjk  and  yjk  are  the  actuator  coordinates  in  the  DM  k  plane.  The  mirror  influence  radius, 
Pm,  is  also  set  equal  to  ro-  Two  different  guide  star  arrangements  are  examined:  1)  a  single  guide 
star  located  on  the  optical  axis  and  2)  4  guide  stars  positioned  symmetrically.  The  4-guide-star 
constellation  is  illustrated  in  Figure  4.6. 


All  guide  stars  are  located  in  the  plane  z  =  The  coordinates  of  the  guide  star  positions 
in  this  plane  are 

^8m  ~  (4-37) 

“  ^g^m,  (4.38) 


To  generalize  the  results,  the  guide  star  positions  are  normalized  by  the  subaperture  dimension  d 
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Figure  4.5.  Square  telescope  aperture  divided  into  Af/2 


— 

Guide  star  3 


Field  of  view 


- ^ 

Guide  star  2 


Figure  4.6.  Telescope  aperture  and  FOV  projected  onto  the  guide  star  plane  where  a  constellation 
of  4  guide  stars  has  been  projected. 
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to  obtain  the  dimensionless  quantities  : 


- 


d 


(4.39) 

(4.40) 


In  the  case  of  the  4-guide-star  array,  the  guide  stars  are  located  at  the  points  (xg,yg),  (ig,— yg), 
(— ig,— t/g),  (— ig.iig).  A  similar  set  of  normalized  coordinates,  (inyr),  is  defined  for  the  natural 
tilt  reference  star.  Finally,  the  normalized  object  coordinates  are 


X 


O 


Vo 


d 


(4.41) 

(4.42) 


where  and  Oq^  (rad)  are  the  x  and  y  components  of  the  object  angle  vector.  As  illustrated  in 
Figure  4.6,  the  mean  square  residual  phase  error  of  the  compensate  object  wavefront  is  evaluated 
as  a  function  of  position  along  a  diagonal  line  from  the  optical  axis  toward  one  of  the  corners.  The 
maximum  allowed  object  coordinates  are  determined  by  the  FOV  of  the  telescope  which  is  specified 
by  the  angle  /?  (see  Figure  4.7).  These  maximum  coordinates  are  designated  (x0,yp): 

^  (4-43) 


For  the  4-guide-star  array  to  probe  the  entire  FOV,  the  normalized  guide  star  locations  need  to  be 


D  . 

=  55  +  *^ 

(4.44) 

D  . 
yg  —  2<f 

(4.45) 
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The  conjugate  plane  to  DM  k  is  defined  to  be  at  the  altitude  z  =  (m).  This  altitude  is 


normalized  by  the  guide  star  altitude  to  obtain  the  dimensionless  quantity  ; 


As  illustrated  in  Figure  4.7,  the  size  of  DM  k,  Dt  (m),  is  given  by 


(4.46) 


Dk=D  +  2dId^X0  (4.47) 

where  small  angle  approximations  have  been  used. 

The  final  system  specification  is  the  determination  of  the  wavefront  components,  As 

explained  earlier,  4>d^(x)  is  that  portion  of  the  total  wavefront  aberration  which  DM  k  is  intended 
to  compensate.  As  shown  in  Figure  4.8,  DM  k  is  assigned  the  wavefront  phase  contribution  of  a 
slab  of  atmosphere,  5* .  (In  this  dissertation,  the  problems  of  how  to  best  divide  the  atmosphere 
into  slabs  and  how  the  conjugate  planes  are  chosen  are  not  addressed.)  The  wavefront  component 
*8  defined  to  be  the  sum  of  the  turbulent  layers  ^i,(x)  contained  within  slab  5^: 

=  X] (4-48) 


4-5.2  Phase  siatisUcs  As  explained  previously,  the  ith  turbulent  layer  alters  the  pheise  of 
a  plane  wave  propagating  through  it  according  to  a  phase  perturbation  function  (rad), 

where  x  =  (x,  y)  (m)  is  a  position  vector  in  the  plane  of  the  layer.  For  computational  purposes, 
this  function  is  assumed  to  be  a  sample  function  of  a  random  process  which  is  described  by  the 
Kolmogorov  structure  function  I?i(x,  x* )  (rad^)  (64): 

Vi{x,j?)  =  ([^,.(x)  -  =  6.88  (4.49) 
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Figure  4.7.  Size  of  DM  k. 
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Figure  4.8.  Portion  of  the  atmosphere  assigned  to  DM  k. 


where  the  angle  brackets  indicate  an  ensemble  average.  The  parameter  »■,  (m)  is  the  turbulent 
layer’s  coherence  diameter.  The  r;  are  related  to  ru,  the  overall  atmospheric  coherence  diameter, 
by  the  formula  (5) 


^-5/3 


(4.50) 


i  =  l 

It  is  also  convenient  to  characterize  the  turbulent  layer  as  having  a  fraction,  qi,  of  the  total  turbu¬ 
lence  strength: 


qi  = 


(4.51) 


and  to  define  a  normalized  layer  altitude  £1,: 


H 


(4.52) 


4.5.3  Noise  statistics  Welsh  and  Gardner  quantified  the  slope  measurement  error  due  to 
photon  noise  associated  with  a  Hartmann-Shack  WFS  (63): 


<^a=  < 


0.747rr7 


d<ro 


rad/m 


(4.53) 


where  ./Vpj,  is  the  total  subaperture  photon  count  and  t]  is  a.  detector  efficiency  factor  which  is 
>  1.  In  the  computations  in  this  chapter,  rj  =  1.35  was  used.  Kane  et  al.  give  this  value  as  that 
associated  with  an  efficient  charge-coupled  device  array  preceded  by  an  image  intensifier  (40). 


4.6  Results 

In  this  section,  the  analysis  method  described  in  Section  4.4  and  the  assumptions  in  Section  4.5 
are  used  to  present  numerical  results  for  single-DM  and  multiple-DM  adaptive  optics  systems 
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imaging  through  a  2-layer  atmosphere.  The  2-layer  model  is  composed  of  a  low-altitude  turbulent 
layer  containing  90%  of  the  turbulence  strength  located  at  1%  of  the  guide  star  altitude,  and  an 
upper  layer  having  the  remaining  10%  of  the  turbulence  at  10%  of  rg.  The  MCAO  system  used  for 
these  computations  is  composed  of  a  WFS,  2  DMs  and  4  guide  stars.  The  WFS  is  a  6  x  6  grid  of 
square  subapertures,  with  subaperturc  dimension  d  =  tq.  For  all  computations,  a  photon  count  of 
A^ph  =  50  photons  per  subaperture  was  assumed.  The  4  guide  stars  are  arranged  in  the  constellation 
shown  in  Figure  4.6,  vcith  Xg  =  pg  =  22.5.  (This  value  was  chosen  because  it  corresponds  to  50 
^rad  when  d  =  0.2  m  and  Zg  =  90  km,  the  nominal  altitude  of  the  sodium  layer.)  The  tilt  reference 
star  is  located  at  the  origin.  A  side  view  of  the  geometry  is  depicted  in  Figure  4.9.  As  shown  in  the 
figure,  DM  1  must  cover  an  area  1.065  times  as  large  as  the  telescope  aperture,  while  DM  2  is  1.65 
times  the  size  of  the  pupil.  Due  to  the  fact  that  the  DM  areas  are  larger  than  the  aperture  area,  as 
well  as  the  finite  height  of  the  guide  stars,  each  guide  star  probes  only  a  fraction  of  each  turbulent 
layer.  Figures  4.10  and  4.11  illustrate  how  a  portion  of  each  layer  will  he  sampled  by  all  4  guide 
stars,  other  sections  will  contribute  to  2  guide  star  wavefronts,  and  some  areas  are  probed  by  only 
one  of  the  guide  stars.  In  the  case  of  layer  1,  each  guide  star  probes  86%  of  the  area  of  DM  1, 
with  74%  of  the  area  common  to  all  4  measurements  and  24%  common  vo  2  of  the  4  measurements. 
Figure  4,11  shows  that  the  larger  area  of  DM  2  is  not  probed  as  thoroughly  by  the  4-guide-star 
array;  only  1%  of  the  area  is  common  to  all  4  measurements  and  16%  common  to  2  measurements. 
Table  4.1  summarizes  the  atmospheric  and  system  parameters  used  in  the  computations  of  this 
section. 

Using  the  parameters  in  Table  4.1,  the  optimum  control  matrices  for  the  2-DM  system  were 
found  using  equation  (4.26).  Then  the  object  wavefront  error  was  evaluated  at  5  points  in  the  FOV 
along  the  diagonal  line  shown  in  Figure  4.6.  The  results  are  shown  in  Figure  4.12.  For  comparison, 
the  object  error  at  the  same  points  for  a  single-DM,  single-guide-star  system  is  also  plotted.  For 
this  “conventional”  adaptive  optical  telescope,  the  DM  is  optically  conjugated  to  the  pupil  and  the 
guide  star  is  located  on  the  optical  axis.  To  allow  a  fair  comparison  with  the  4-guide-star  system. 
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Tilt  guide  star 
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Figure  4.10.  Areas  of  turbulent  layer  1  probed  by  each  guide  star  in  4-guide-star  array.  The 
double-cross-hatched  area  is  probed  by  all  4  guide  stars,  the  single  cross-hatched 
area  is  probed  by  2  guide  stars,  while  the  small  non-cros.s-hatched  square  areas  in 
each  corner  are  probed  by  only  one  guide  star. 
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Figure  4.11.  Areas  of  layer  2  probed  by  each  guide  star  in  4-guide-star  array.  Ihe  double-cross- 
hatched  area  is  probed  by  all  4  guide  stars,  the  single  cross-hatched  area  is  probed 
by  2  guide  stars,  while  the  non-cross-hatched  square  areas  in  each  corner  are  probed 
only  one  guide  star. 
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Parameter 

Sytnbol 

Value 

Number  of  turbulent  layers 

/ 

2 

Normalized  altitude  of  turbulent  layer  1 

-1. 

0.01 

Fraction  of  turbulence  strength  in  lajer  1 

0.90 

Normalized  altitude  of  turbulent  layer  2 

-l2 

0.10 

Fraction  of  turbulence  strength  in  layer  2 

92 

0.10 

Ratio  of  aperture  to  subaperture  size 

D/d 

6.0 

Ratio  of  subaperture  to  seeing  cell  size 

d/ro 

1.0 

WFS  efficiency  factor 

V 

1.35 

Detected  photons  per  subaperture 

WEim 

50 

Number  of  DMs 

^91 

2 

Normalized  altitude  of  conjugate  plane  1 

0.01 

Normalized  altitude  of  conjugate  plane  2 

_ **^2 _ 

0.10 

Guide  star  positions 

WSMM 

22.5 

Maximum  object  coordaiates 

ip,  yp 

19.5 

Tilt  reference  star  position 

^TiVr 

0. 

Table  4.1.  Parameters  for  MCAO  problem. 


4  measurements  from  the  single  guide  star  were  combined  (which  reduces  the  measurement  noise 
power  by  a  factor  of  4).  F^r  both  systems,  overall  tilt  was  compensated  by  removing  the  best-fit 
plane  from  an  on-axis  natural  guide  star  from  the  object  wavefront  (errors  in  the  tilt  compensation 
system  were  not  modeled,  but  tilt  anisoplanatism  is  properly  accounted  for  since  the  tilt  reference 
star  and  object  point  are  not  necessarily  in  the  same  location).  The  figure  indicates  that  the  MCAO 
system  is  able  to  compensate  its  entire  FOV  to  an  rms  residual  phase  error  of  A/7.  However,  the 
single-DM  system  outperforms  the  MCAO  system  for  object  points  out  to  about  1/3  of  the  way  from 
the  optical  axis  to  the  corner  of  the  FOV.  The  single-DM  system  does  very  well  in  the  neighborhood 
of  the  optical  axis  since  its  single  guide  star  is  located  there.  In  effect,  the  single-DM  system  is 
designed  to  optimize  performance  along  a  single  observation  direction — in  the  direction  of  the  single 
reference.  By  comparison,  the  multiconjugate  system,  in  this  implementation,  does  not  optimize  for 
a  particular  observation  direction  but  attempts  to  compensate  the  entire  FOV  as  well  as  possible. 
As  a  result,  a  system  which  has  an  optimum  observation  direction  has  better  performance  in  that 
direction,  at  the  expense  of  worse  performance  in  other  directions. 
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Figure  4.12.  RMS  residual  phase  error  averaged  over  object  wavefront  vs.  object  position  for  2- 
DM  adaptive  telescope  with  4  guide  stars  compared  with  “conventional”  single-DM 
system. 
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Figure  4.13  illustrates  the  effect  of  laser  guide  star  tilt  blindness.  In  this  figure,  the  perfor¬ 
mance  of  the  2-DM,  4-laser-guide-star  system  is  compared  with  a  hypothetical  system  employing 
4  reference  sources  (e.g.  4  sufficiently  bright  natural  stars  in  just  the  right  locations)  that  also 
provide  overall  tilt  information.  This  hypothetical  system  does  not  use  a  separate  tip-tilt  mirror, 
but  corrects  the  tilt  component  using  the  DMs.  From  examination  of  the  figure,  we  see  that  tilt 
anisoplanatism  significantly  limits  the  performance  of  a  realistic  MCAO  system. 


Figure  4.13.  Performance  of  2-DM  system  with  multiple  WFS  reference  sources  that  provide 
overall  tilt  information  compared  with  system  using  laser  guide  stars  providing  only 
higher-order  phase  information. 


One  might  expect  that  we  could  improve  on  a  “conventional”  adaptive  telescope  design, 
by  conjugating  the  mirror  to  a  plane  closer  to  the  turbulence.  In  Figure  4.14,  three  cases  are 
compared  corresponding  to  the  DM  conjugated  to  1)  the  aperture  2)  the  strongest  turbulent  layer, 
and  3)  the  center  of  gravity  (CG)  of  the  turbulence  profile.  The  results  show  that,  for  this  2-layer 
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atmosphere,  slightly  better  performance  is  obtained  if  the  DM  is  conjugated  to  the  aperture  when 
the  FOV  criterion  is  average  rms  residual  phase  error  <  A/8.  However,  conjugating  the  mirror  to 
the  strongest  turbulent  layer  is  better  for  object  points  more  than  one-third  of  the  way  to  the  edge 
of  the  FOV.  In  this  case,  conjugating  the  DM  to  the  center  of  gravity  of  the  turbulence  profile  is 
the  least  effective  choice.  The  2-DM  system  appears  to  work  better  than  either  of  the  single-DM 
alternatives. 


Object  position,  z^ld 


Figure  4.14.  RMS  residual  phase  error  averaged  over  object  wavefront  vs.  object  position  for 
single-DM  adaptive  telescope  and  various  choices  of  DM  position,  compared  with 
2-DM  system.  For  the  single-DM  cases  with  the  DM  conjugated  to  the  strongest 
layer  and  the  turbulence  center  of  gravity,  the  4  guide  star  constellation  was  used. 


4-7  Summary 

Wallner’s  adaptive  optics  system  analysis  method  has  been  generalized  to  allow  for  an  arbi¬ 
trary  number  of  guide  stars,  DMs,  and  atmospheric  layers.  This  method  can  be  used  to  compute 
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analytical  performance  predictions  for  a  variety  of  adaptive  optics  problems.  The  chief  limita¬ 
tion  is  the  impracticality  of  computing  results  for  large  telescopes  due  to  excessive  computation 
time.  However,  the  technique  does  provide  further  insight  into  the  performance  of  various  adap¬ 
tive  telescope  system  configurations  imaging  through  different  atmospheres.  The  results  show  that 
multiple-DM  systems  do  hold  promise  for  widening  the  compensated  FOV,  provided  that  multiple 
wavefront  measurements  from  different  guide  star  locations  can  be  made  before  the  atmosphere 
changes  significantly.  Performance  of  an  MCAO  system  is  still  limited  by  tilt  anisoplanatism,  since 
tilt  information  is  not  obtainable  from  the  artificial  guide  stars. 
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V'.  Conclusions  and  Recommendations 


5.1  Major  Results 

The  major  results  of  this  research  effort  are  llie  following: 

1)  Development  of  an  analytical  method  for  estimating  the  contribution  of  a  portion  of  the 
turbulent  atmosphere  to  the  total  optical  wavefront  distortion  resulting  from  propagation 
through  the  entire  atmosphere. 

2)  Development  of  a  practical  method  of  controlling  an  arbitrary  number  of  DMs  in  an  adaptive 
optical  telescope  to  achieve  a  useful  level  of  phase  compensation  over  a  wide  FOV. 

3)  The  first  analytical  performance  prediction  for  a  multiple-DM,  multiple-laser-guide  star  tele¬ 
scope  imaging  through  a  layered  atmosphere  incorporating  the  integrated  effects  of  WFS 
noise;  angular,  focus,  and  tilt  anisoplanatism;  finite  aperture  sizes;  DM  influence  functions; 
and  laser  guide  star  tilt  blindness. 

4)  Quantitative  results  showing  that  multiple  DMs  and  multiple  artificial  guide  stars  can  be 
used  to  increase  the  compensated  FOV  of  an  adaptive  optical  telescope  under  good  observing 
conditions. 

5)  Quantitative  results  showing  the  effect  of  conjugating  a  single  DM  to  various  choices  for 
correction  planes,  as  compared  with  the  addition  of  a  second  DM. 

6)  Quantitative  results  showing  the  effect  of  laser  guide  star  tilt  blindness  on  the  performance 
of  a  wide-FOV  adaptive  optics  system. 

5.  S  Discussion 

One  way  to  quantify  the  compensated  FOV  for  an  adaptive  optical  telescope  is  to  specify 
the  area  over  which  a  particular  level  of  residual  phase  error  is  achieved.  Two  examples  of  such 
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FOV  criteria  are  residual  phase  error  levels  less  than  A/7  or  A/8.  Figure  4.12  shows  that,  for  the 
2-layer  atmosphere  studied  in  Chapter  IV,  a  2-DM  system  increases  the  compensated  area  by  a 
factor  of  4  for  either  of  these  FOV'  criteria.  If  d  =  tq  =  0.2  m.  these  results  show  that  a  1.2  meter 
2-DM  adaptive  optical  telescope  with  4  sodium-layer  guide  stars  can  compensate  a  square  FOV  90 
microradians  (18  arcsec)  on  a  side  to  a  level  of  residual  phase  error  <  A/8,  vs.  45  microradians  (9 
arcsec)  for  the  single-DM  system.  If  the  FOV  criterion  is  A/7,  the  MCAO  system  can  compensate  a 
FOV  135  microradians  (28  arcsec)  vs.  70  microradians  (14  arcsec)  for  the  single-DM  system.  This 
value  of  ro  might  be  seen  under  excellent  observing  conditions  at  visible  wavelengths.  At  infrared 
wavelengths,  a  typical  value  of  tq  is  0.5  meters.  Hence,  the  plots  in  Chapter  IV  apply  to  a  3  meter 
telescope  imaging  in  the  infrared  and  the  compensated  areas  increase  accordingly. 

Continuing  this  example,  the  importance  of  laser  guide  star  tilt  blindness  may  be  illustrated. 
According  to  Figure  4.13,  if  laser  guide  stars  did  not  have  this  limitation,  the  2-DM  telescope 
could  compensate  the  larger  area  (i.e.  135  microradians  on  a  side)  to  A/9  instead  of  A/7.  This 
corresponds  to  an  increase  in  the  Strehl  ratio  from  0.45  to  0.61. 

5.2.1  Recommendations  for  Future  Work  .Several  areas  remain  to  be  investigated  in  the 
study  of  MCAO  systems. 

5.2.2  Wave  Optics  Diffraction  alters  the  phase  aberration  function  in  the  telescope  aperture 
due  to  turbulence  at  a  distance.  Also,  propagation  over  a  significant  distance  causes  some  of  the 
phase  effects  to  convert  to  amplitude  effects  (scintillation).  An  interesting  question  remaining  to 
be  answered  is  whether  conjugating  the  DMs  in  an  adaptive  telescope  to  correction  planes  close  to 
the  turbulence  can  reduce  scintillation  in  the  telescope  aperture. 

5.2.2  Temporal  Effects  The  refractive  index  of  the  atmosphere  is  a  temporal  as  well  as  a 
spatial  random  process.  Since  a  finite  amount  of  time  elapses  between  wavefront  measurement  and 
correction,  degradation  in  the  level  of  compensation  results.  In  a  multiple  guide  star  system  that 
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collects  the  WFS  measurements  sequentially,  the  different  guide  star  measurements  in  the  ensemble 
will  be  slightly  different  due  to  the  time  delay  between  measurements. 

5.2.4  Closed  Loop  Control  Closed  loop  control  has  been  successfully  achieved  for  single- 
DM,  single-guide-star  adaptive  optics  systems.  Closed  loop  control  of  multiple-DM  systems  could 
be  troublesome  due  to  the  optical  coupling  among  the  DMs. 

5.2.5  Wtde  Field  Tilt  Compensation  The  results  of  this  dissertation  show  if  we  are  to  ap¬ 
proach  the  full  theoretical  potential  of  MCAO  systems,  a  means  of  sensing  the  overall  tilt  component 
from  a  variety  of  directions  in  the  FOV  is  required. 


5-3 


Appendix  A.  Derivation  of  Equations  in  Chapter  III 


A.  I  Expected  Value  of  Product  of  Convolutions 

In  the  following  section,  the  expected  value  of  the  product  of  two-dimensional  convolutions  is 
needed.  In  the  subsequent  derivation,  all  random  processes  are  assumed  to  be  wide  sense  stationary. 


([/i(ri-i/i)  *  *9i(xi,yi)]  [Mr2,y2)  *  *92(x-j,y2)]) 


(iV2(fi(xi  -6,1/1  -»?i)/2(a;2  -6.J/2  -  r}2))9i{C<'ni)92{^2,r]2) 
^hhDi  -^1  -  -^2  +  6,yi  -  »?i  - 1/2  +  q2) 


Xtfi{^i,r?i)s2(6,»72) 


(A.l) 


Letting  Ax  =  Xi  —  X2  and  Ay  =  j/i  —  j/2. 


<[/l  (*  1 , 1/1 )  *  *» l(*l ,  yi )]  [f2{X2, 1/2)  ♦  *92{X2.  1/2)]) 

=  y<^»72r/,/3(Ai-f  6 -6.^y  +  »?2  -  »7i)</i(6.»7i)i/2(6.»?2) 

=  y  yd7/2[r/,/j(Ax  +  6.Ay-t-T/2)**yi(Ax-f6,Ay+?72)]y2(6.'72) 

=  r/,/,(Ax,  Ay)  ♦  *yi(Ax,  Ay)*  ♦y2(-Ax,-Ay)  (A.2) 


A. 2  Equation  (3.27) 

In  this  section  the  autocorrelation  function  of  the  wavefront  component  estimate,  (x ),  is 
derived.  The  development  begins  with  equation  (3.25),  which  is  the  expression  for  the  wavefront 
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component  estimate. 


/  M  r  i  ^ 

=  ^  J  )  ♦  ♦/lsa(^' +  ■?)  *  +  ■*") 

\m=:l  .*'  =  1 


M  r  I 


m'  =  l  Li"=l 

M  M  jX  I 

=  EE  E  V?i_,(x' +  f  +  +»)m(*'  +  X)  ♦♦/lsa(^  +  J")  *  +  ■?) 

mrrl  171^  =  1  \  .«'=! 


X  ‘Pi, II  +  ^\,„0m')  +  Irn'i^  )  *  *fls&(x' )  *  *hvi,^,(^ 


Using  equation  (A. 2)  from  the  preceding  section, 


r^,(x)  = 


M  M  jX  I 

^  yi./(^  +  X  +  zi,,^m)  +  >?m(^  4-  x) 

m=l  m'  =  l  \  Li'=l 

*  *  hsa,ix)  *  ♦/»8a(-ic)  ♦  *hm,„ix)  *  ♦/jw.„,(-*) 


' 

.  L»''=i 


(x*  +  +  Vm'i^  ) 


Using  the  mutual  independence  of  random  processes  <p\,{x)  and  ??m(*  )•  and  the  fact  that  convolu¬ 
tion  distributes  over  addition  (23): 


r^,(x)  = 


"  f  "  ^  r  -  -  1 

m=l  Vm'  =  l  i'  =  l 

-|-r,(x)  ♦  ♦/»sa(x  )  *  ♦/»8a(-*  )  *  **».».(*)  ♦ 

m=l  lm'=l  «'=t 

-f-r„(x )  ♦  *hw.„{x)  *  )|  *  */‘8a(* )  *  *f*s»{-x)  (A.i 


*  *Aw,„(x  )  ♦  */>w,„i(  x) 
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A. 3  Equation  (3.39) 


Substituting  equations  (3.31 )  and  (3.34)  into  equation  (3.30),  the  mean  square  error  expre.ssion 
becomes 


/•/sa  c/sa  1 

=  crl^+  /  dfrdfyl 

3  —  fs&  /sa  1 

r 

|hwi,  1”  +  l^wul*  +  Re  ^/*iihi2^w,,/iw,2| 

-t-Re  — 

+  ^t)  1'  +  |^W,2  i 

(A.6) 

Assumption  (3.32)  means  that 

h^2  ~  ^11 

(A.7) 

^22  =  ^21 

(A.8) 

« 

t-c 

II 

(A.9) 

With  these  substitutions,  the  integrand  of  equation  (A.6)  becomes 

[l^wii  1^  +  +  Re  +  Re  |h2jhw,,h*,j|  ~  j 

+  (A.IO) 

Assumption  (3.33)  means  that 

h2i  =  Mi  (A.ll) 

This  final  substitution  leads  to  equation  (3.39). 
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Appendix  B.  Derivation  of  Equations  in  Chapter  IV 


B.l  Equation  •'^4-9) 

The  derivation  of  the  first  term  of  equation  4.{i  is  shown  for  jr-directed  slopes.  The  procedure 
is  exactly  the  same  for  the  j/-slope.  The  z-directed  slope  from  the  mth  guide  star  measured  in  the 
nth  subaperture  is  the  gradient  of  the  total  phase  in  the  z-direction  averaged  over  the  subaperture; 


r  f 

«/— oo  •/— oo 


dx  Wn(x,y) 


Using  integration  by  parts  on  the  z  integral  with 


equation  (B.l)  becomes 


_  dWn{x,y) 


/CO  r  oo  .Qo 

dy  <Wn{x,y)<pt^{x,y)  -  dx 

■OO  I  — OO  J  —  OO 


5VU„(z,y) 


The  first  term  of  the  preceding  equation  is  0  since  the  subaperture  has  finite  size,  which  implies 
lT„(z,j/)  =  0  at  infinity.  Thus, 


/•oc  p  dWnix,y) 


j  dWn(x,y)  ,  , 

dx — — v>t„(a;,y) 
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B.2  Equation  (4-19) 


Substituting  equation  (4.16)  into  the  first  term  of  equation  (4.18), 

j  d-xW^^{x)(^4>\^(x)^  =  J  (Px\V^^(x){MkS  ■rt{x)MkS  Tkix)) 

=  J  d-xW^^{x){Tk(x)  •  MtsMfcS  •  rjt(f )) 

~  )(rJ(^)Mtss'^Mjrife(3;)) 

=  J  d‘^xW^^{x)Tj{x)Mk  (ss'^)MjTk(x) 

=  j  d-xW^^(x)rJ(x)MkJ:„Mlrk{S)  (B.4) 

For  two  vectors  b  and  c  having  the  same  number  of  elements  (6): 

c^b  =  tr  [bc^]  (B.5) 

Letting 

b 

o'" 

equation  (B.4)  becomes 

J  d^xlVi,(x)(^l(x))  =  Jd^xWi^(x)tT[^„Mjrk{x)Tj{x)Mk] 

=  tr[E„MjRtMt] 

=  tr  [MjRtMiS,,]  (B.6) 

since  for  any  square  matrix  B 

tr[B]  =  tr[B^]  (B.7) 
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=  E„M|rt(x) 
=  rT(x)Mt 


In  the  final  step  of  equation  (B.6),  the  symmetry  of  the  S,,  and  R/t  matrices  was  used. 


B.  i  Equation  (4-23) 

Substituting  equation  (4.16)  into  the  third  term  of  equation  (4.18), 

J  d-xWd,{x)(^^d„ix)’t>d,(^))  -  J  d'^x\\\(x){MkS  ■Tk(x)(i)d^(x)) 

=  j  d~xWd^{x){s'^Mjrk(x)<j)d,(x)) 

-  J  d-xWd^(x){tt[rk(x)<i>d^(x)s^Mj]) 

=  j  <^'^B';iJx)tr  [rfc(£)((|id^  (f)s'^)Mj] 

=  tr  [AfcMj]  (B.8) 

If  a  matrix  B  is  of  order  (i  x  j)  and  a  matrix  C  is  of  order  (j  x  i)  then  (6)-. 

tr[BC]  =  tr[CB]  (B.9) 

Hence,  equation  (B.8)  may  be  written  in  several  forms: 

J  d-xWd^(x)(^^dA^)<i>d^(x)^  =  tr[MjAifc] 

=  tr[AjMifc]  (B.IO) 

B.4  Equation  (4-^d) 

The  derivation  of  the  minimizing  control  matrix  starts  with  equation  (4.25)  in  its  summation 

form: 

(ft)  =  (fo^) + E  E  E  L  -  2  E  E  (Bii) 

j  j'  n  n'  jo 
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The  minimizing  control  matrix  is  found  by  setting  the  first  derivative  of  the  preceding  equation, 
with  respect  to  an  arbitrary  element  of  the  matrix,  equal  to  zero.  Consider  element  Mki- 

i^l)  =  ((qi,)  +  +  -2AklMkl 

j'  n‘  j  n 

+[terms  not  multiplied  by  A/ti]  (B.12) 

The  derivative  with  respect  to  Mki  is 

+  ^  -2Tit,  (B.13) 

j’  n*  j  n 

Setting  the  preceding  equation  equal  to  zero  and  writing  in  matrix  form  results  in 


RMS  =  * 


(B.14) 


Solving  for  M* : 


M*  =R-‘AE-* 


(B.15) 


To  show  that  this  stationary  point  is  a  minimum,  the  second  derivative  may  be  examined; 


fMli) 

dMl 


=  2Rkk^u  >  0 


fB.16) 


Since  equation  (4.25)  is  quadratic  in  Mj,  it  is  a  global  minimum. 


B.5  Equation  (4-33) 


Substituting  equation  (4.16)  into  equation  (4.29)  (with  the  tilt  correction  term  dropped)  and 
removing  the  piston  in  the  wavefront  estimate  yields 


'  -  r  -  /•  - 1 

f(x,0o)  =  5Z'Pi.(^+ ^i.^o)  -  ^  Mks  rk(x-i-z^Jo)-  d'^xVVa(x)Mks  rk{x  +  z^Jo) 

i  =  l  k  =  l 


K 


=  rt(f+Cd^C)-rj:(0o) 


1  =  1 


ib  =  l 


(B.17) 


Substituting  the  preceding  equation  into  equat.on  (4.30)  and  following  the  same  procedure  as  in 
Sections  B.2  and  B.3: 


{(^{0o))  =  4(00)  +  J  d'^iw^ix) 

K  K 

^  5m  -  rt(^>)]  Mfc-s  •  Vk'ix  +  Zi^,0o)  -  f*-(eo)]) 


*=1  *'=1 


/I  K 

d'^xW^ix)'^  ^  ^M*s  •  [r<.(f  +  z^Jo)  -  rt(^o)]  +  ^i/o)) 

=  J  d^xW^(x)tTi^'S„Mj^rk{x  +  ZiJo)-rk(0o)\  [rt<(x  +  Zd*,^o)  -  ft'(^o)]  M*| 

//  K 

H  -  flt(^o)]  (‘Puix  +  :i/o)s'^)  Mfc  I 

»=!  k=l 

(B.18) 


Defining  the  R**'(0o)  matrix: 


R-]tib'(^o)  = 


j  d^xW^(x)  r*(x  +  Xd*^o)-rt(^o)]  [rfc'(^+ -  rt.(^o)] 

=  J  d'^xWa{x)rk(x  +  ZdJo)rJ.ix  +  2d»/o)  -  2fk(0o)fJ,{0o)  +  Tk(0o)Tj.{0o) 

=  J  d^xWi,{x)Tk(x  +  Zd,0o)rk'{x  +  Zd^,0o)-Tk{0o)i  J'{0o)  (B.19) 


B-5 


B.6  Equation  (4-35) 


Defining  the  Aikido)  matrix  from  the  third  term  of  equation  (B.18); 

A,*(^o)  =  y  [ri(?+rd*tfo)-ri(^o)]  (v?i,('?+ -i.^o)s^) 

=  J  d-xWi,{x)Tk{x  +  Jo)  (^‘P\,ix  + 

-  J  d~xW^{x)vk{6o)(-M^+  (B.20) 

The  second  term  of  the  preceding  equation  is  equal  to  0  because  of  the  definition  of  hence 

Aifc(^o)  =  J  d‘^xWe,{x)Tk{x  +  ZdJo)(j\,{x  +  (B.21) 
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Appendix  C.  Additional  Results 


In  this  appendix,  some  additional  computational  results  tising  the  analysis  of  Chapter  IV  are 
presented  and  discussed.  Unless  otherwise  indicated,  the  geometry  of  Figure  4.6  and  the  parameters 
in  Table  4.1  w’ere  used  in  the  computations. 

C.  I  Photon  Noise 

Figure  C.l  shows  that  the  residual  phase  error  in  the  object  wavefront  decreases  asymptoti¬ 
cally  as  the  light  level  is  increased.  With  a  larger  number  of  photons  per  subaperture,  the  WFS  is 
able  to  obtain  a  better  estimate  of  the  centroid  of  the  laser  guide  star  image  in  each  subaperture. 
This  improved  centroid  estimate  results  in  more  accurate  slope  measurements.  The  values  for  Nph 
in  the  figure  are  based  on  the  photon-limited  noise  model  presented  in  Chapter  IV.  Higher  light 
levels  are  required  to  obtain  similar  results  if  readout  noise  is  significant. 

C.2  Guide  Star  Separation 

The  designer  of  an  MCAO  system  may  also  be  interested  in  the  answer  to  the  following 
question:  Given  the  available  resources  (number  of  guide  stars  and  DMs)  and  a  known  atmospheric 
turbulence  profile,  how  large  a  field  of  view  may  be  compensated  while  maintaining  a  specified  level 
of  residual  phcise  error?  Figure  C.2  addresses  this  question  for  the  MCAO  system  described  by 
Figure  4,6  and  Table  4.1.  The  figure  plots  the  maximum  level  of  residual  phase  error  in  the  FOV 
vs.  guide  star  separation.  If  the  meiximum  allowed  RMS  error  for  any  point  in  the  FOV  is  A/7, 
then  the  separation  between  the  guide  stars  is  limited  to  about  9  times  the  aperture  dimension  for 
this  2-layer  atmospheric  profile. 
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Figure  C.l. 


RMS  residual  phaise  error  averaged  over  object  wavefront  vs.  object  position  for 
2-DM  adaptive  telescope  with  4  guide  stars  with  light  level  eis  a  parameter. 
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C.3  Guide  Star  Projection  Aperture 


Figure  4.13  shows  how  the  performance  of  an  MCAO  system  is  limited  due  to  the  inability 
to  mecisure  overall  tilt  using  laser  guide  stars.  However,  suppose  the  telescope  aperture  we  wish  to 
compensate  is  smaller  than  the  laser  guide  star  projection  aperture.  As  shown  in  Figure  C.3,  the 
overall  tilt  could  be  measured  across  the  telescope  aperture,  since  it  is  seen  as  part  of  a  higher-order 
component  from  the  point  of  view  of  the  projection  aperture.  This  point  is  illustrated  quantitatively 
in  Figure  C.4.  which  shows  how  the  performance  of  the  MCAO  system  improves  as  the  laser  guide 
star  projection  aperture  incretises  in  size.  The  curves  labeled  Dpf  D  —  1  and  Dp/D  —  oc  correspond 
to  the  two  curves  in  Figure  4.13. 

C.4  Upper  DM  Actuator  Density 

In  the  results  shown  in  Chapter  IV,  the  same  interactuator  spacing  was  used  for  both  DMs. 
Because  of  the  larger  size  of  the  upper  DM,  as  shown  in  Figure  4.9,  it  requires  an  array  of  100 
actuators  vs.  49  for  the  lower  DM  when  the  .same  interactuator  spacing  is  used.  However,  the  upper 
DM  is  compensating  for  the  weaker  of  the  two  turbulent  layers.  Since  the  upper  layer  accounts  for 
only  10%  of  the  total  turbulence  strength,  it  has  very  little  power  at  high  spatial  frequencies.  Thus, 
the  DM  assigned  to  this  layer  should  be  able  to  effectively  compensate  for  the  aberrations  due  to 
this  layer  with  fewer  actuators.  In  other  words,  since  the  upper  DM  reconstructs  lower  spatial 
frequency  surfaces  than  the  lower  DM,  it  can  allow  a  wider  interactuator  spacing.  In  Figure  C.5, 
the  interactuator  spacing  was  increased  on  the  upper  DM  to  2  x  ro  (25  actuators),  3  x  ro  (16 
actuators),  and  4  x  ro  (9  actuators).  The  figure  shows  that  performance  initially  improves  as  the 
interactuator  spacing  increases.  It  is  actually  detrimental  to  use  too  many  actuators,  since  good 
information  about  the  high  spatial  frequencies  achievable  by  a  dense  actuator  array  is  not  available 
for  the  weak  turbulent  layer.  As  the  number  of  actuators  is  further  decreased  toward  the  limit  of 
0,  the  performance  curve  in  Figure  C.5  begins  to  tip  up  toward  the  single-DM  performance  curve. 
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Figure  C.3. 


The  overall  tilt  component  for  a  small  aperture  is  part  of  a  b'gher-order  component 
for  a  large  aperture. 
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Object  position,  z^%ld 


Figure  C.4. 


RMS  residual  phase  error  averaged  over  object  wavefront  vs.  object  position  for  2- 
DM  adaptive  telescope  with  4  guide  stars  with  projection  aperture  dimension  as  a 
parameter. 


RMS  error  (rad) 
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